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DETECTABLE LABELED NUCLEOSIDE 
ANALOGS AND METHODS OF USE THEREOF 

1. CROSS-REFERENCE TO RELATED APPLICATION 

This application claims the benefit under 37 C.F.R. § 1 19(e) of prior 
U.S. Application No. 60/428,484, filed November 22, 2003, which application is 
incorporated by reference herein in its entirety. 

2. FIELD OF THE INVENTION 

The invention relates to detectable labels usefiil, for example, for the 
detection of nucleotide sequences. Specifically, the invention relates to labeled 
imidazole-PEG compounds, such as nucleosides, nucleotides, and nucleic acids 
incorporating such compounds, and methods utilizing such compounds. The 
invention further relates to kits comprising labeled imidazole-PEG compounds. 

3. BACKGROUND OF THE INVENTION 

Many procedures employed in biomedical research and recombinant 
DNA technology rely on the use of labeled nucleotide or polynucleotide derivatives. 
In order for a modified nucleotide to be generally suitable as a labeled form of a 
naturally occurring nucleotide, several criteria must typically be satisfied. First, the 
modified compound must contain a substituent or probe that is unique (/.e., not 
normally found associated with nucleotides or polynucleotides). Second, the probe 
must react specifically with chemical or biological reagents to provide a sensitive 
detection system. Third, the analog must be a relatively efficient substrate for 
commonly studied nucleic acid enzymes, since numerous practical applications 
require that the analog be enzjmiatically metabolized {e.g.^ the analog must fimction 
as a substrate for nucleic acid polymerases). For this purpose, probe moieties should 
not be placed on ring positions that sterically or otherwise interfere with the normal 
Watson Crick hydrogen bonding potential of the bases. In such cases, the substituents 
can yield compounds that are inactive as polymerase substrates. Fourth, the detection 
system should be capable of interacting with probe substituents incorporated into both 
single stranded and double stranded poljoiucleotides in order to be compatible with 
nucleic acid hybridization methodologies. Fifth, the physical and biochemical 
properties of polynucleotides containing small numbers of probe substituents should 
not be significantly altered so that current procedures using hybridization probes need 
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not be extensively modified. This criterion must be satisfied whether the probe is 
introduced by enzymatic or direct chemical means. Finally, the linkage that attaches 
the probe moiety should withstand all experimental conditions to which normal 
nucleotides and polynucleotides are routinely subjected (e.g,, extended hybridization 
5 times at elevated temperatures, phenol and organic solvent extraction, or 
electrophoresis). 

The specificity and tenacity of the biotin avidin complex has been used 
in recent years to develop methods for visually localizing specific proteins, lipids, or 
carbohydrates on or within cells (E. A. Bayer and M. Wilchek, Methods of 

10 Biochemical Analysis^ 26, 1, 1980). Chromosomal location of RNA has been 

determined by electron microscopy using a biotinized protein {e,g.^ cytochrome C) 
chemically crosslinked to RNA as a hybridization probe. The site of hybridization 
was visualized through the binding of avidin ferritin or avidin methacrylate spheres 
mediated by the avidin biotin interaction. (Manning et al., 1975, Chromosoma, 53: 

15 107; Msinning, 1977, Biochemistry, 61 : 1364; Broker, 1978, Nucleic Acid Res.y 5: 

363; and Sodja 1978, et aL, Nucleic Acid Res.^ 5: 383). This approach to the detection 
of polynucleotide sequences, although successful in the specialized cases examined, 
which were highly reiterated sequences, is not of general utility for analysis of 
polynucleotides present in single or low copy number. 

20 Accordingly, there remains a continuing need for detectable labeled 

compounds that are safe, cost effective, stable, efficient, and provide for sensitive 
detection. 

4. SUMMARY OF THE INVENTION 

The invention provides a series of novel nucleotide derivatives that 
25 contain a label covalently attached to an imidazole compound. In one aspect, the 
invention relates to a class of compounds that are useful as labels for detection of 
nucleotide sequences. In another aspect, the invention relates to oligonucleotides 
comprising a detectable label. The invention further relates to methods of use and 
methods of making the detectable labels of the invention. 
30 The invention encompasses compounds of formula I: 
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O R 



Base^^ N ^^Linker)— Label 
Sugar 



I 

alone or in combination with a counterion thereof or a stable salt, solvate, clathrate or 
mixture thereof, wherein: 
5 sugar is a ribose, deoxyribose or a ribose sugar analog; 

base is 

wherein the sugar is covalently bonded to of the base; 

wherein Ri is -H, -NHC(0)NH2, -NH2, -OH, -O(alkyl), alkyl, CO2H; 
10 wherein R2 is -H, -ISIH2, -OH, -O(alkyl), alkyl, CO2H, -C02alkyl; 

linker is any linker known to those of skill in the art; and 

label is any label known to those of skill in the art. In one embodiment 
the linker is (-O-alkyl)n, (NH)n, (C(0)NH)n, (NHC(0)NH)n, (NH-alkyl-O-alkyl-NH)n, 
or (alkyl)n; wherein n is an integer from 1 to 30. In one embodiment, label is a 
15 colorimetric compound, a chemiluminescent compound, a bioluminescent compound, 
a quencher, a fluorescent compound or a non- or weakly fluorescent compound (each 
compound being a "detectable label"). 

The invention also encompasses oligonucleotides comprising a 
detectable label of formula U: 

O R2 
Base'^'^N'^Linker)— Label 

20 Sugar 

n 

alone or in combination with a counterion thereof or a stable salt, 
solvate, clathrate or mixture thereof, wherein: 

sugar is a ribose, deoxyribose or a ribose sugar analog; 
25 base is 
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wherein the sugar is covalently bonded to N of the base; 

wherein R, is -H, -NHC(0)NH2, -NH2, -OH, -O(alkyl), alkyl, CO2H; 

wherein R2 is -H, -NH2, -OH, -O(alkyl), alkyl, CO2H, -C02alkyl; 

hnker is any linker known to those of skill in the art; and 

label is any label known to those of skill in the art. In one embodiment 



linker is (alkylO)n, (NH)n, (C(0)NH)n, (NHC(0)NH)n, (NH-alkyl-0-alkyl-NH)n, or 
(alkyl)n; wherein n is an integer from 1 to 30. In one embodiment, label is a 
colorimetric compound, a chemilimiinescent compound, a bioluminescent compoxmd, 
a quencher, a fluorescent compound or a non- or weakly fluorescent compound. 

Another embodiment encompasses a method for detecting the presence 
or absence of a target nucleic acid in a test sample, comprising: 

contacting the test sample with a composition comprising an oligo- or 
polynucleotide probe wherein the oligo- or polynucleotide probe comprises a 
detectable label of formula: 



X 



Base' 



O R2 O 
' H H 




H 



Base is 




wherein the sugar is covalently bonded to N of the base; 
wherein R, is -H, -NHC(0)NH2, -NH2, -OH, -O(alkyl), alkyl, CO2H; 
wherein R2 is -H, -NH2, -OH, -O(alkyl), alkyl, CO2H, -C02alkyl; 
each of X, Y, and Z is independently -H, -OH, -O(alkyl), -SH, -SR4, 



-NHR4, NR4R5, 




> O O 

_ II II 

-O— , R3O-P-O-P-O— 

>R3 ORa ORa 



000 
II II II 

R3O-P-O-P-O-P-O 
^1 I I 

OR3 OR3 OR3 



wherein R3 is -H or metal; 
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R4 is -H or alkyl; 
R5 is -H or alkyl; 
Qis O, S, orNH; 

linker is any linker known to those of skill in the art; 
5 label is any label known to those of skill in the art; 

wherein the oligonucleotide probe is capable of selectively or 
specifically hybridizing with the target nucleic acid under conditions of moderate 
stringency; 

optionally exposing the sample to light of said excitable wavelength; 

10 and 

detecting whether said contacting produces a change in color or 
fluorescence emission of the composition; 

such that a change in color or fluorescence emission of the 
composition indicates that the target nucleic acid is present in the test sample. In one 
15 embodiment, label is a colorimetric compound, a chemiluminescent compound, a 
bioluminescent compound, a quencher, 

a fluorescent compoimd or a non- or weakly fluorescent compound. 

Another embodiment encompasses method for detecting ribonuclease 
activity in a test sample, comprising: 
20 contacting the test sample with a substrate, thereby creating a test reaction 

mixture, wherein said substrate comprises a nucleic acid molecule comprising: 

i, a cleavage domain comprising a single stranded region, said 
single stranded region comprising at least one intemucleotide 
linkage immediately 3' to an adenosine residue, at least one 

25 intemucleotide linkage immediately 3* to a cytosine residue, at 

least one intemucleotide linkage immediately 3* to a guanosine 
residue, and at least one intemucleotide linkage immediately 3' 
to a uridine residue, and wherein said cleavage domain does not 
comprise a deoxyribonuclease cleavable intemucleotide 

30 linkage; and 

ii. a detectable label of the invention on one side of at least one of 
the intemucleotide linkages; 

incubating said test reaction mixture for a time sufficient for cleavage of the 
substrate by a ribonuclease in the sample; and 
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determining whether a detectable signal is emitted from the test reaction mixture, 
wherein emission of a detectable signal from the reaction mixture 
indicates that the sample contains ribonuclease activity. 

Another embodiment encompasses a method for detecting ribonuclease 
5 activity in a test sample, comprising: 

contacting the test sample with a substrate, thereby creating a test reaction 
mixture, wherein said substrate comprises a nucleic acid molecule comprising: 

i. a cleavage domain comprising a single stranded region, said 
single stranded region comprising at least one intemucleotide 

10 linkage inmiediately 3' to an adenosine residue, at least one 

intemucleotide linkage immediately 3' to a cytosine residue, at 
least one intemucleotide linkage immediately 3* to a guanosine 
residue, and at least one intemucleotide linkage immediately 3* 
to a uridine residue, and wherein said cleavage domain does not 

15 comprise a deoxyribonuclease cleavable intemucleotide 

linkage; and 

ii. a detectable label on one side of at least one of the 
intemucleotide linkages; 

incubating said test reaction mixture for a time sufficient for cleavage of the 
20 substrate by a ribonuclease activity in the test sample; 

determining whether a detectable signal is emitted from the test reaction mixture; 
contacting a control sample with the substrate, said control sample comprising a 
predetermined amount of ribonuclease, thereby creating a control reaction mixture; 
incubating said control reaction mixture for a time sufficient for cleavage of the 
25 substrate by a ribonuclease in the control sample; 

determining whether a detectable signal is emitted from the control reaction 
mixture; 

wherein detection of a greater signal in the test reaction mixture than in 
the control reaction mixture indicates that the test sample contains greater 
30 ribonuclease activity than in the control sample, and wherein detection of a lesser 
signal in the test reaction mixture than in the control reaction mixture indicates that 
the test sample contains less ribonuclease activity than in the control sample, and 
wherein detection of a signal in the test reaction mixture eiqual to that in the control 
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reaction mixture indicates that the test sample contains the same amount of 
ribonuclease activity as is present in the control sample. 

Another embodiment encompasses methods of making a compoimd of 

formula I: 

O R2 



Base^^N^^lLinker)— Label 
I 

Sugar 



I 

alone or in combination with a counterion thereof or a stable salt, 
solvate, clathrate or mixture thereof, wherein: 

sugar is a ribose, deoxyribose or a ribose sugar analog; 
10 base is 



wherein the sugar is covalently bonded to of the base; 
wherein R, is -H, -NHC(0)NH2, -NH2, -OH, -O(alkyl), alkyl, CO2H; 
wherein R2 is -H, -NH2, -OH, -O(alkyl), alkyl, CO2H, -C02alkyl; 
1 5 linker is any linker known to those of skill in the art; and 

label is any label known to those of skill in the art. In one embodiment 
linker is (alkylO)n, (NH)n, (C(0)NH)n, (NHC(0)NH)n, (NH-alkyl-O-alkyl-NH)n, or 
(alkyl)n; wherein n is an integer from 1 to 30. In one embodiment, label is a 
colorimetric compound, a chemiluminescent compound, a bioluminescent compound, 
20 a quencher, a fluorescent compound or a non- or weakly fluorescent compound. 

Another embodiment encompasses a method of detecting endonuclease 
activity in a test sample comprising: 

contacting the test sample with a substrate, thereby creating a test reaction 
mixture, wherein said substrate comprises a nucleic acid molecule comprising 
25 a detectable label and a quencher; 

incubating the test reaction mixture for a time sufficient for cleavage of the 
substrate by endonuclease activity in the test sample to separate the detectable 
label from the quencher; 
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determining whether a detectable signal is emitted from the test reaction 
mixture; 

contacting a control sample with the substrate, the control sample comprising 
a predetermined amount of endonuclease, thereby creating a control reaction 
5 mixture; 

incubating the control reaction mixture for a time sufficient for cleavage of the 
substrate by an endonuclease in the control sample; 

determining whether a detectable signal is emitted from the control reaction 
mixture; 

10 wherein detection of a greater, signal in the test reaction mixture than in 

the control reaction mixture indicates that the test sample contains greater 
endonuclease activity than in the control sample, and wherein detection of a lesser 
signal in the test reaction mixture than in the control reaction mixture indicates that 
the test sample contains less endonuclease activity than in the control sample, and 
15 wherein detection of a signal in the test reaction mixture equal to that in the control 
reaction mixture indicates that the test sample contains the same amoimt of 
endonuclease activity as is present in the control sample. 

Another embodiment encompasses a method of making a labeled 
polynucleotide comprising: 
20 contacting a double or single stranded nucleotide with a terminal deoxynucleotidyl 

transferase imder conditions suitable to afford a 3' hydroxyl terminus; 

contacting with a detectable label of the invention, under conditions suitable for 
the attachment of a compound to a 3* hydroxyl terminus of the double or single 
stranded nucleotide to afford a labeled nucleotide; and 
25 detecting the labeled nucleotide. 

Another embodiment encompasses a method for detecting ribonuclease 
activity in a test sample, comprising: 

contacting the test sample with a substrate, thereby creating a test reaction 
mixture, wherein the substrate comprises a nucleic acid molecule comprising: 
30 i. a cleavage domain comprising a single stranded region, the 

single stranded region comprising at least one intemucleotide 
linkage immediately 3' to an adenosine residue, at least one 
intemucleotide linkage immediately 3' to a cytosine residue, at 
least one intemucleotide linkage immediately 3' to a guanosine 
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residue, and at least one intemucleotide linkage immediately 3* 
to a uridine residue, and wherein the cleavage domain does not 
comprise a deoxyribonuclease-cleavable intemucleotide 
linkage; and 

5 ii. a detectable label on one side of the intemucleotide linkages; 

incubating the test reaction mixture for a time sufficient for cleavage of the 
substrate by a ribonuclease activity in the test sample; 

determining whether a detectable signal is emitted from the test reaction mixture; 

contacting a control sample with the substrate, the control sample comprising a 
10 predetermined amount of ribonuclease, thereby creating a control reaction mixture; 

incubating the control reaction mixture for a time sufficient for cleavage of the 
substrate by a ribonuclease in the control sample; 

determining whether a detectable signal is emitted from the control reaction 
mixture; 

15 wherein detection of a greater signal in the test reaction mixture than in 

the control reaction mixture indicates that the test sample contains greater 
ribonuclease activity than in the control sample, and wherein detection of a lesser 
signal in the test reaction mixture than in the control reaction mixture indicates that 
the test sample contains less ribonuclease activity than in the control sample, and 

20 wherein detection of a signal in the test reaction mixture equal to that in the control 
reaction mixture indicates that the test sample contains the same. 



4.1 DEFINITIONS 

As used herein and unless otherwise indicated, the term "alkyl group" 
25 means a saturated, monovalent, imbranched or branched hydrocarbon chain. 

Examples of alkyl groups include, but are not limited to, Ci-Ce alkyl groups, such as 
methyl, ethyl, propyl, isopropyl, 2-methyl-l -propyl, 2-methyl-2-propyl, 2-methyl-l- 
butyl, 3-methyl-l -butyl, 2-methyl-3 -butyl, 2,2-dimethyl-l -propyl, 2-methyl-l -pentyl, 
3-methyl-l-pentyl, 4-methyl-l -pentyl, 2-methyl-2-pentyl, 3-methyl-2 pentyl, 4- 
30 methyl-2-pentyl, 2,2-dimethyl-l -butyl, 3,3-dimethyl-l -butyl, 2-ethyl-l -butyl, butyl, 
isobutyl, f-butyl, pentyl, isopentyl, neopentyl, and hexyl, and longer alkyl groups, 
such as heptyl, and octyl. An alkyl group can be unsubstituted or substituted with one 
or two suitable substituents. 
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As used herein and unless otherwise indicated, the term — O— alkyl (or 
alkyl-O-) means an "alkoxy group," wherein alkyl is as defined above. An alkoxy 
group can be unsubstituted or substituted with one or more suitable substituents. The 
alkyl chain of an alkyloxy group can be, for example, firom 1 to 6 carbon atoms in 
5 length. 

As used herein and unless otherwise indicated, the term "nietal" refers 
to a group I or group II metal, including but not limited to, Li^, Na"^, Ca^"^, or Mg^^. 

As used herein and unless otherwise indicated, the term "substituted" 
refers to a radical in which one or more hydrogen atoms are each independently 
10 replaced with the same or different substituent(s). Typical substituents include, but 
are not limited to, alkyl, alkoxy, -X, -R, -O , =0, -OR, -O-OR, -SR, -S , =S, -NRR, 
-NR, -CX3, -CN, -OCN, -SCN, -NCO, -NCS, -NHCHO, -NHCOCi-C4alkyl, 
-NHCOCH3, -NHCOCH2CI, -NHCOCHCI2, -NHCOCCI3, -NHCOCF3, 
-NHCOCH2C6H4-0-NO2, NHCOCH2OC6H4-0-NO2, where each X is independently a- 
15 halogen (preferably -F, -CI or -Br) and each R is independently -H, alkyl, alkenyl, 
alkynyl, aryl, arylalkyl, arylheteroalkyl, heteroaryl, heteroarylalkyl or heteroaryl 
heteroalkyl, as defined herein. 

As used herein and unless otherwise indicated, the term "linking 
group" and "linker" are used interchangeably and refer to a moiety of a detectable 
20 label capable of covalently bonding a base with a label, e.g., forming a "linkage" that 
connects the nucleoside, nucleotide or nucleic acid to the label. Exsimples of linkers 
include, but are not limited to, O, S, or NH. Optionally, the linking group or linker is 
a covalent bond (i.e., the label is covalently bonded to the base). 

As used herein and unless otherwise indicated, the term "counterion" 
25 refers to an ion that is stable and synthetically accessible. Examples of counterions 
include, but are not limited to, chloride, bromide, iodide, sulfate, benzene sulfonate, 
p-toluenesulfonate, p-bromobenzenesulfonate, methanesulfonate, 
trifluoromethanesulfonate, phosphate, perchlorate, tetrafluoroborate, 
tetraphenylboride, nitrate and anions of aromatic or aliphatic carboxylic acids. 
30 Preferred counterions are chloride, iodide, perchlorate and the sulfonates listed above. 

As used herein, a substance that is '^biologically compatible" is not 
toxic as used, and does not have a substantially deleterious effect on biomolecules. 

As used herein and unless otherwise indicated, the terms "biotin" or 
"biotin derivative" refer respectively to biotin or a compound wherein one of the 
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atoms of biotin is replaced with another suitable atom; examples of suitable atoms 
include, but are not limited to, -O, -N, -P, or -S. Examples of biotin or a biotin 
derivative include, but are not limited to, biotin or iminobiotin, which have the 
following respective structures: 



O NH 




As used herein and unless otherwise indicated, the term "nucleobase" 
refers to adenine, cytidine, guanine, thymine or uracil. 

As used herein and imless otherwise indicated, the term "nucleobase 
analog" refers to a substituted or imsubstituted nitrogen-containing parent 

10 heteroaromatic ring that is capable of forming Watson-Crick hydrogen bonds with a 
complementary nucleobase or nucleobase analog. Preferably, the nucleobase analog 
is a purine, deazapurine or pyrimidine. Exemplary nucleobase analogs include, but 
are not limited to, 7-deazaadenine, inosine, nebularine, nitropyrrole, nitroindole, 2- 
aminopurine, 2,6-diaminopurine, hypoxanthine, pseudouridine, 5-propynylcytidine, 

15 isocytidine, isoguanine, 7-deazaguanine, 2-thiopyrimidine, 6-thioguanine, 4- 

thiothymine, 4-thiouracil, O^-methyl guanine, A^-methyl adenine, O'^-methyl thymine, 
5,6-dihydrothymine, 5,6-dihydrouracil, 4-methylindole, ethenoadenine, etc. 
Additional exemplary nucleobase analogs can be found in Fasman, 1989, Practical 
Handbook of Biochemistry and Molecular Biology, pp. 385-394, CRC Press, Boca 

20 Raton, FL, and the references cited therein, incorporated herein by reference. 

As used herein and unless otherwise indicated, the term "nucleoside" 
refers to a compound consisting of a nucleobase covalently linked to the CT carbon of 
a substituted or unsubstituted ribose sugar. Typical substituted ribose sugars include, 
but are not limited to, those in which one or more of its carbon atoms, preferably one 

25 and most preferably the 3' carbon atom, is substituted with one or more of the same or 
different -R, -OR, — NRR or halogen groups, where each R is independently -H, (Ci- 
Ce) alkyl or (C5 -Cu) aryl. Particularly preferred ribose sugars are ribose, 2*- 
deoxyribose, 2*,3'-dideoxyribose, 3*-haloribose, 3'-fluororibose, 3 -chlororibose, 3'- 
alkylribose, etc. When the nucleobase is A or G, the ribose sugar is attached to the 

30 position of the nucleobase. Where the nucleobase is C, T or U, the pentose sugar is 
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attached to the position of the nucleobase (see, e.g., Komberg and Baker, 1992, 
DNA Replication, 2nd Ed., Freeman, San Francisco). 

As used herein and unless otherwise indicated, the term "nucleoside 
analog" refers to a nucleoside in which the nucleobase, the ribose sugar, or both, are 
5 replaced with their respective analogs. Exemplary nucleobase analogs are those 
previously defined. Exemplary ribose sugar analogs include, but are not limited to, 
substituted or luisubstituted fiiranoses having more or fewer than 5 ring atoms, e.g.^ 
erythroses and hexoses and substituted or unsubstituted 3-6 carbon acyclic sugars. 
Typical substituted fiiranoses and acyclic sugars are those in which one or more of the 
10 carbon atoms are substituted with one or more of the same or different — R, -OR, 

-NRR or halogen groups, where each R is independently -H, (Ci-Ce) alkyl or (C5 -Cu) 
aryL 

As used herein and unless otherwise indicated, the term "nucleotide" 
refers to a nucleoside in which one or more, typically one, of the ribose carbons is 
1 5 substituted with a phosphate ester having the formula: 



O 
II 



— O-hff-O-hP-O-OH 



O 



O 
II 



O" 



a 

where a is an integer from 0 to 4. Preferably, a is 2 and the phosphate ester is 
attached to the 3' or 5* carbon of the ribose, e.g., ribose 3*- triphosphate, 2'-deoxyribose 
3 -triphosphate, ribose 5 -triphosphate, 2*-deoxyribose 5 '-triphosphate, 3'-haloribose 5*- 

20 triphosphate, 3'-alkylribose 5*-triphosphate, 2',3*-dideoxyribose 5'-triphosphate, etc. 

As used herein and unless otherwise indicated, the term "nucleotide 
derivative" refers to a nucleotide in which the nucleobase, the ribose sugar and/or one 
or more of the phosphate esters is replaced with its respective analog. Exemplary 
nucleobase and ribose sugar analogs are those previously described in conjunction 

25 with nucleoside analogs. Exemplary phosphate ester analogs include, but are not 
limited to, alkylphosphonates, methylphosphonates, phosphoramidates, 
phosphotriesters, phosphorothioates, phosphorodithioates, phosphoroselenoates, 
phosphorodiselenoates, phosphoroanilothioates, phosphoroanilidates, 
phosphoroamidates, boronophosphates, peptide nucleic acid (PNA) monomers, etc., 

30 including any associated counterions, if present. 
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As used herein and unless otherwise indicated, the term "protecting 
group" means a group that is reversibly attached to a hydroxyl or amine moiety that 
renders the hydroxyl or amine moiety unreactive during a subsequent reaction(s) and 
that can be selectively cleaved to regenerate the hydroxyl or amine moiety once its 
5 protecting purpose has been served. Examples of protecting groups are found in 
Greene, T.W., Protective Groups in Organic Synthesis, 3rd edition (1999), 
incorporated herein by reference. In one embodiment, the protecting group is stable 
in a basic reaction medium, but can be cleaved by acid. Examples of base-stable, 
acid-labile protecting groups suitable for use with the invention include, but are not 

10 limited to, ethers, such as methyl, methoxy methyl, methylthiomethyl, 
methoxyethoxymethyl, bis(2 chloroethoxy)methyl, tetrahydropyranyl, 
tetrahydrothiopyranyl, tetrahyrofiiranyl, tetrahydrothiofuranyl, 1 -ethoxyethyl, 1- 
methyl 1-methoxyethyl, /-butyl, allyl, benzyl, o-nitrobenzyl, triphenylmethyl, a- 
naphthyidiphenylmethyl, /7-methoxyphenyldiphenylmethyl, 9-(9-phenyI-l 0- 

15 oxo)anthranyl, trimethylsilyl, isopropyldimethylsilyl, /-butyldimethylsilyl, /- 

butyldiphenylsilyl, tribenzylsilyl, and triisopropylsilyl; and esters, such as pivaloate, 
adamantoate, and 2,4,6-trimethylbenzoate. 

As used herein and imless otherwise indicated, the term "salt(s)," 
includes, but is not limited to, salts of acidic or basic groups that may be present in the 

20 compounds of the invention. Compounds that are basic in nature are capable of 

forming a wide variety of salts with various inorganic and organic acids. The acids 
that may be used to prepare acceptable acid addition salts of such basic compounds 
are those that form nontoxic acid addition salts, i.e., salts containing 
pharmacologically acceptable anions, including, but not limited to, sulfuric, citric, 

25 maleic, acetic, oxalic, hydrochloride, hydrobromide, hydroiodide, nitrate, sulfate, 

bisulfate, phosphate, acid phosphate, isonicotinate, acetate, lactate, salicylate, citrate, 
acid citrate, tartarate, oleate, tannate, pantothenate, bitartarate, ascorbate, succinate, 
maleate, gentisinate, fumarate, gluconate, glucaronate, saccharate, formate, benzoate, 
glutamate, methanesulfonate, ethanesulfonate, benzenesulfonate, p toluenesulfonate 

30 and pamoate (i.e., l,r-methylene-bis-(2-hydroxy-3-naphthoate)) salts. Compounds of 
the invention that include an amino moiety also can form acceptable salts with various 
amino acids, in addition to the acids mentioned above. Compounds of the invention 
that are acidic in nature are capable of forming base salts with various 
pharmacologically acceptable cations. Examples of such salts include alkali metal or 
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alkaline earth metal salts and, particularly, calcium, magnesium, sodium, lithium, 
zinc, potassium, and iron salts. 

As used herein and unless otherwise indicated, the term "solvate" 
means a compound of the invention or a salt thereof, that further includes a 
5 stoichiometric or non stoichiometric amoimt of a solvent bound by non-covalent 

intermolecular forces. Preferred solvents are volatile and non-toxic. The term solvate 
includes hydrates. Hydrate means a compound of the invention or a salt thereof, that 
further includes a stoichiometric or non-stoichiometric amount of water bound by 
non-covalent intermolecular forces. 

10 As used herein and unless otherwise indicated, the term "clathrate" 

means a compound of the invention or a salt thereof in the form of a crystal lattice 
that contains spaces (e.g.^ channels) that have a guest molecule (e.g., a solvent or 
water) trapped within. 

As used herein and unless otherwise indicated, the term "nucleoside or 

15 nucleotide" refers to nucleosides and/or nucleotides and/or mixtures thereof. 

"Nucleoside analog" refers to a nucleoside in which the nucleobase, 
the ribose sugar, or both, are replaced with their respective analogs. Exemplary 
nucleobase analogs are those previously defined. Exemplary ribose sugar analogs 
include, but are not limited to, substituted or unsubstituted furanoses having more or 

20 fewer than 5 ring atoms, e.g., erythroses and hexoses and substituted or unsubstituted 
3-6 carbon acyclic sugars. Typical substituted furanoses and acyclic sugars are those 
in which one or more of the carbon atoms are substituted with one or more of the 
same or different -R, -OR, — NRR or halogen groups, where each R is independently 
H, Ci-C6 alkyl or C5-C14 aryl. 

25 As used herein and unless otherwise indicated, the term "nucleotide" 

refers to a nucleoside in which one or more, typically one, of the ribose carbons is 
substituted with a phosphate ester having the formula: 



O 
II 

-O-P-O 



o 

-P-OH 
I 

a 



where a is an integer from 0 to 4. Preferably, a is 2 and the phosphate ester is 
30 attached to the 3 - or 5'-carbon of the ribose, e.g., ribose 3'-triphosphate, 2'- 

deoxyribose 3 '-triphosphate, ribose 5 '-triphosphate, 2'-deoxyribose 5'-triphosphate, 3'- 
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haloribose 5 '-triphosphate, 3'-alkylribose 5 '-triphosphate, 2',3'-dideoxyribose 5 - 
triphosphate, etc. 

As used herein and unless otherwise indicated, the term*'nucleotide 
analog" refers to a nucleotide in which the nucleobase, the ribose sugar and/or one or 
5 more of the phosphate esters is replaced with its respective analog. Exemplary 
nucleobase and ribose sugar analogs are those previously described in conjunction 
with nucleoside analogs. Exemplary phosphate ester analogs include, but are not 
limited to, alkylphosphonates, methylphosphonates, phosphoramidates, 
phosphotriesters, phosphorothioates, phosphorodithioates, phosphoroselenoates, 

1 0 phosphorodiselenoates, phosphoroanilothioates, phosphoroanilidates, 

phosphoroamidates, boronophosphates, peptide nucleic acid (PNA) monomers, etc., 
including any associated counterions, if present. 

As used herein and unless otherwise indicated, the term "nucleic acid" 
refers to a linear polymeric chain of nucleoside monomer units that are covalently 

15 connected to one another by phosphate ester intemucleotide linkages. Unless stated 
otherwise, "nucleic acid" as used herein includes polymers of any length, including 
oligonucleotides, nucleic acids and nucleic acids as those terms are commonly used in 
the art. Thus, nucleic acids according to the invention can range in size from a few 
monomer units {e.g.y 4 to 40), to several hundreds of monomer units, to several 

20 thousands of monomer units, or even more monomer units. Such nucleic acids can 
also be described herein in terms of their functions, such as primer or probe. 
Whenever a nucleic acid is represented by a sequence of letters, e.g.^ "ATGCCTG," it 
will be understood that the sequence is presented in the 5*^ 3' direction. Unless 
otherwise indicated, nucleic acids whose sequences are described herein are 2*- 

25 deoxyribonucleic acids. 

As used herein and unless otherwise indicated, the term "nucleic acid 
analog" refers to a nucleic acid in which at least one nucleoside monomer unit is a 
"nucleoside analog" and/or at least one phosphate ester intemucleotide linkage is a 
phosphate ester analog, as defined above under "nucleotide analog." Preferred classes 

30 of nucleic acid analogs are those in which the sugar and intemucleotide linkages are 
replaced with an uncharged, neutral amide, such as a morpholino carbamate and 
peptide nucleic acids CTNA"). Preferred PNAs are those having a N-(2-aminoethyl) 
glycine amide backbone (see, e.g.^ Nielsen et aL, 1991, Science 254:1497-1500). 
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As used herein and unless otherwise indicated, the term "label" refers 
to a detectable molecule or atom attached, covalently or non-covalently, to a 
nucleoside or nucleotide, nucleoside or nucleotide analog, nucleic acid, nucleic acid 
analog or terminator. In one embodiment, a nucleoside or nucleotide, nucleoside or 
5 nucleotide analog, nucleic acid, nucleic acid analog or terminator has a detectable 
label covalently attached to the nucleobase. The term "label" can also refer to a 
molecule that modulates detection of another detectable label, such as a quencher. As 
used herein, the term "detectable label" is intended to include not only a molecule or 
label which is "directly" detected (e.g., a chromogen or a fluorophore) but also a 

10 moiety (e.g., biotin) which is "indirectly" detected by its binding to a second, third, or 
greater binding partner (e.g., avidin or streptavidin), one of which carries a "direct" 
label. In one embodiment, the oligonucleotide is biotin-modified, and is detectable 
using a detection system based on avidin or streptavidin which binds with high 
affinity to biotin. The avidin or streptavidin can be conjugated to an enzyme, the 

15 presence of which is detected by allowing the enzyme to react with a chromogenic 
substrate and measuring the color developed (i.e., "colorimetric compound"). Non- 
limiting examples of useful enzymes in the methods of the present invention are 
horseradish peroxidase (HRP), alkaline phosphatase, glucose-6-phosphate 
dehydrogenase, malate dehydrogenase, staphylococcal nuclease, delta- V-steroid 

20 isomerase, yeast alcohol dehydrogenate, of-glycerophosphate dehydrogenase, triose 
phosphate isomerase, asparaginase, glucose oxidase, j3-galactosidase, ribonuclease, 
urease, catalase, glucoamylase and acetylcholinesterase. 

Other examples of labels include a fluorescent compound, which, 
when exposed to light of the proper wavelength, becomes detectable due to 

25 fluorescence and is detected and/or measured by microscopy or fluorometry. 

Commonly used fluorescent labeling compounds include fluorescein isothiocyanate, 
rhodamine, phycoerythrin, phycocyanin, allophycocyanin, 7-phthalaldehyde and 
fluorescamine. The detectable label can be a fluorescence-emitting metal such as 
^^^Eu, or others of the lanthanide series which can be attached to the oligonucleotide 

30 using metal chelating groups, such as diethylenetriaminepentaacetic acid or 
ethylenediaminetetraacetic acid. 

The label can be a chemiluminescent compound, the presence of which 
is detected by measuring luminescence that arises during the course of a chemical 
reaction. Examples of useful chemiluminescent labeling compounds are luminol, 
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isoluminol, theromatic acridinium ester, imidazole, acridinium salt and oxalate ester. 
Likewise, a bioluminescent compound can be used to label the oligonucleotide and is 
detected by measuring luminescence. In this case, a catalj^ic protein increases the 
efficiency of the chemiluminescence reaction. Examples of useful bioluminescent 
5 labeling compounds include luciferin, luciferase and aequorin. 

As used herein and unless otherwise indicated, the term "reporter dye" 
refers to a compound which, when exposed to light, emits energy in the form of 
fluorescence. "The chromophore of a reporter dye" is the network of atoms of the 
reporter dye that, when exposed to light, emits radiation at a level that is detectable by 

10 conventional spectroscopic means. 

As used herein and unless otherwise indicated, the term "non- 
fluorescent" refers to a compound that, when exposed to radiation, does not emit 
radiation at a level that is detectable by conventional spectroscopic means. 

As used herein and unless otherwise indicated, the term "weakly 

15 fluorescent" refers to a compound that, when exposed to radiation, emits radiation at a 
low level that is detectable by conventional spectroscopic means. 

As used herein and unless otherwise indicated, the term "light" refers 
to electromagnetic energy having a wavelength which causes a reporter dye to 
fluoresce, wherein that wavelength may be in the range of 190-800 nm. 

20 As used herein and unless otherwise indicated, the term "specific" 

refers to a nucleic acid used in a reaction, such as a probe used in a hybridization 
reaction, a primer used in a PGR, or a nucleic acid present in a composition, that 
hybridizes only with the intended target but not to other nucleic acid molecules in the 
test sample in the normal testing environment. 

25 As used herein and unless otherwise indicated, the term "selective" 

refers to a nucleic acid used in a reaction, such as a probe used in a hybridization 
reaction, a primer used in a PGR, or a nucleic acid present in a pharmaceutical 
preparation, that hybridizes with the intended target more fi-equently, more rapidly, or 
with greater duration than it does with other nucleic acids in the test sample in the 

30 normal testing environment. 

As used herein, the term "hybridizes under stringent conditions" 
describes conditions for hybridization and washing under which nucleotide sequences 
at least 60% (65%, 70%, or 75% or greater) identical to each other typically remain 
hybridized to each other. Stringent conditions depend on the nature of the nucleic 
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acid (e.g. length, GC content, etc.) and the method itself (hybridization, amplification, 
etc.). Such methods are well known in the art and can be found in Current Protocols 
in Molecular Biology, John Wiley & Sons, N.Y. (1989), 6.3.1-6.3.6. In one 
embodiment, stringent hybridization conditions are hybridization at 6X sodium 
5 chloride/sodium citrate (SSC) at about 45° C, followed by one or more washes in 
O.IXSSC, 0.2% SDS at about SS"" C. In another embodiment, stringent hybridization 
conditions are hybridization in 6XSSC at about 45° C, followed by one or more 
washes in 0.2 X SSC, 0.1% SDS at 50 65° C (i.e., one or more washes at 50° C, 55° C, 
60° C or 65° C). It is imderstood that the nucleic acids of the invention do not include 

10 nucleic acid molecules that hybridize imder these conditions solely to a nucleotide 

sequence consisting of only A or T nucleotides. For example, stringent hybridization 
of an oligonucleotide of approximately 15-40 bases to a complementary sequence in 
the polymerase chain reaction (PCR) can be performed under the following 
conditions: a salt concentration of 50 mM KCl, a buffer concentration of 10 mM Tris 

15 HCl, a Mg^^ concentration of 1 .5 mM, a pH of 7-7.5 and an annealing temperature of 
55-60° C. Moderate stringency hybridization of an oligonucleotide of approximately 
15-40 bases to a complementary sequence in the poljnnerase chain reaction (PCR) can 
be performed under the following conditions: a salt concentration of 50 mM KCl, a 
buffer concentration of 10 mM Tris HCl, a Mg^^ concentration of 1.5 mM, a pH of 7- 

20 7.5 and an annealing temperatm-e of 48-54° C. Low stringency hybridization of an 
oligonucleotide of approximately 15-40 bases to a complementary sequence in the 
polymerase chain reaction (PCR) can be performed under the following conditions: a 
salt concentration of 50 mM KCl, a buffer concentration of 10 mM Tris HCl, a Mg^^ 
concentration of 1.5 mM, a pH of 7-7.5 and an annealing temperature of 37-47° C. 

25 As used herein and unless otherwise indicated, the term 

"stereomerically pure'' refers to a composition that comprises one stereoisomer of a 
compound and is substantially free of other stereoisomers of that compound. For 
example, a stereomerically pure composition of a compound having one chiral center 
will be substantially free of the opposite enantiomer of the compound. A 

30 stereomerically pure composition of a compound having two chiral centers will be 
substantially free of other diastereomers of the compound. A typical stereomerically 
pure compound comprises greater than about 80% by weight of stereoisomer of the 
compound and less than about 20% by weight of other stereoisomers the compound, 
more preferably greater than about 90% by weight of one stereoisomer of the 
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compound and less than about 10% by weight of the other stereoisomers of the 
compound, even more preferably greater than about 95% by weight of one 
stereoisomer of the compound and less than about 5% by weight of the other 
stereoisomers of the compound, and most preferably greater than about 97% by 
5 weight of one stereoisomer of the compound and less than about 3% by weight of the 
other stereoisomers of the compound. 

As used herein and imless otherwise indicated, the term "substantially 
free" refers to a composition that comprises one compoxmd and is free of detectable or 
significant quantities of other compounds. A typical substantially free composition 

10 comprises greater than about 80% by weight of the desired compound and less than 
about 20% by weight of one or more other compoimds, more preferably greater than 
about 90% by weight of the desired compound and less than about 10% by weight of 
one or more other compounds, even more preferably greater than about 95% by 
weight of the desired compound and less than about 5% by weight of one or more 

15 other compounds, and most preferably greater than about 97% by weight of the 
desired compound and less than about 3% by weight of one or more other 
compoimds. 

4.2 ABBREVIATIONS 

The abbreviations used throughout the specification to refer to certain 
20 nucleobases and nucleosides or nucleotides are those commonly employed in the art 



and are as follows: 

Expression Abbreviation 

adenine A 

cytosine C 

guanine G 

thymine T 

uracil U 

ribonucleoside 5' triphosphate NTP 

adenosine 5* triphosphate ATP 

cytidine 5' triphosphate CTP 

guanosine 5* triphosphate GTP 

thymidine 5' triphospate TTP 

uridine 5' triphosphate UTP 
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Expression 


Abbreviation 


2 deoxyribonucleoside 5 tnphosphate 


dNTP 


2 deoxjnriboadenosine 5 tnphosphate 


dATP 


2 deoxyribocytidine 5 triphosphate 


dCTP 


2 deoxyriboguanosme 5 tnphosphate 


dGTP 


2 deoxynbothymidine 5 tnphospate 


dTTP 


2 deoxynboxindme 5 tnphosphate 


dUTP 


2 ,3 dideoxynbonucleoside 5 tnphosphate 


ddNTP 


2',3* dideoxyriboadenosine 5' triphosphate 


ddATP 


2',3' dideoxyribocytidine 5' triphosphate 


ddCTP 


2',3' dideoxyriboguanosine 5' triphosphate 


ddGTP 


2',3' dideoxyribothymidine 5' triphosphate 


ddTTP 


2',3' dideoxyribouridine 5' triphosphate 


ddUTP 



The following additional abbreviations are used in the specification: 
The abbreviations used throughout the specification are those 

commonly employed in the art and are as follows: 



Term 



Abbreviation 



2- [4-(2-chloro-4nitrophenylazo)-N- 
ethylphenylamino]ethanol 

6-carboxyfluorescein 

N,N,N',N-tetramethyl-6-carboxyrhodamine 
6-carboxytetramethykhodamine 
Indodicarbocyanine 3 
Indodicarbocyanine 5 
Indodicarbocyanine 5.5 

3- (€-carboxy pentyl)-3 -ethyl-5,5 -dimethyloxacarbocyanine 

5- [(2-aminoethyl)amino]napthalene- 1 -sulfonic acid 
tetrachlorofluorescein 

2',7*-dimethoxy-4*,5'-dichloro-6-carboxy-fluorescein 

6- carboxy-X-rhodamine 

lH,5H,llH,15H-Xantheno[2,3,4 ij:5,6,7 i'j*]diquinolizin-18- 
ium, 9-[2(or 4) [[[6 [2,5-dioxo-l-pyrrondinyl)oxy]-6- 
oxohexyl]amino]sulfonyl]-4(or 2)-sulfophenyl]- 
2,3,6,7, 12, 13,1 6, 17-octahydro-, inner salt 

N,N*-dimethyl- N,N'-diphenyl- 4-((5-t- 



Disperse Red 13 

6Fam 
TAMRA 
6Tamra 
Cy3 
Cy5 
Cy5.5 
CyA 
EDANS 
TET 
JOE 
ROX 
TR or Texas Red 



QSYTM 7 
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butoxycarbonylaminopentyl)aminocarbonyl) 
piperidinylsulfonerhodamine 

(4-(4-dimethylaminophenylazo)benzoic acid) 

polymerase chain reaction 

5. DETAILED DESCRIPTION OF THE INVENTION 

The invention stems, in part, from the discovery that having a 
detectable label directly attached to nucleotide derivatives that are capable of 
5 functioning as enzyme substrates offers considerable versatility, both in the 
experimental protocols that can be performed and in the detection methods 
(microscopic and non-microscopic) that can be utilized for analysis. Detectable labels 
{e.g,, biotin nucleotides) can be introduced into polynucleotides, which are in the 
process of being synthesized by cells or crude cell extracts, thus making it possible to 

10 detect and/or isolate nascent (growing) polynucleotide chains. Furthermore, enzymes 
can be used as reagents for introducing probes, such as biotin nucleotides, into highly 
selective or site-specific locations in polynucleotides. 

In one embodiment, a detectable label of the invention can be attached 
to the 5' or the 3' terminus of an oligo- or polynucleotide, and/or within the sequence. 

1 5 The detection sensitivity depends, for example, on the number of detectable labels 
attached to the oligonucleotide, primer, or probe. The signal intensity and the 
detection sensitivity increases with the number of detectable labels attached to the 
oligonucleotide. Intemal labeling may introduce steric hindrance resulting in a lower 
sensitivity and a destabilizing effect on a hybrid formed between the oligo- or 

20 polynucleotide and a target sequence. Therefore, to compromise between hybrid 
stability, detection sensitivity and cost, in certain embodiments a typical oligo can 
contain about 3 detectable labels of the invention, one at both termini and one 
intemal, with a minimum of about 1 5 bases between the moieties. The synthesis of 
nucleotides containing a detectable label of the invention is achieved as indicated in 

25 the examples that follow. Imidazole nucleoside triphosphates containing a detectable 
label probe attached to the C-5 carbon atom are practical substrates for a wide variety 
of purified nucleic acid polymerases of both prokaryotic and eukaryotic origin, 
including but not limited to, DNA polymerase I of E, coli, bacteriophage T4 DNA 
polymerase, DNA polymerases a and jS from murine (A 9) and human (HeLa) cells, 

30 terminal deoxjoiucleotidyl transferase, and the DNA polymerase of Herpes simplex 



Dabcyl 
PGR 
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virus. In addition, detectable label ribonucleoside triphosphates may function as 
substrates for the RNA polymerases of E. coli and bacteriophage T7. Indeed, 
detectable labeled RNA probes can be prepared enzymatically from DNA templates 
using E. coli or T7 RNA polymerases or by 3* end-labeling methods using RNA ligase 
with compounds such as biotinyl-pCp. With the availability of antibodies to these 
analogs, the isolation of nascent transcripts by immunological or affinity procedures is 
also feasible. 

In a modification of the invention, analogs of nucleotides such as, but 
not limited to, dUTP and UTP, that contain a biotin molecule covalently bound to the 
ring through an allylamine polyethylene glycol linker arm can be synthesized. These 
biotinyl nucleotides are efficient substrates for a variety of DNA and RNA 
polymerases in vitro. DNA containing low levels of biotin substitution (50 molecules 
or less/kilobase) has denaturation, reassociation and hybridization characteristics 
which are indistinguishable from that of unsubstituted control DNA. 

As a non-limiting example, biotinylation provides a label or "tag" that 
transforms poorly or undetectable molecules into probes that can be recognized by a 
detection reagent or an affinity capture matrix. Once labeled with biotin, the labeled 
nucleotide can be used to probe complex protein and nucleic acid blots and arrays. 
This tagged molecule can then be detected with the appropriate avidin conjugate that 
has been labeled with a fluorophore, fluorescent microsphere, enzyme, chromophore, 
magnetic particle or colloidal gold. Biotinylated molecules can also be captured with 
various forms of immobilized streptavidin, such as streptavidin agarose, or stained for 
electron microscopy with reactive gold compounds, such as NANOGOLD® 
(Nanoprobes, Inc., Yaphank, NY) or Alexa Fluor® FluoroNanogold streptavidin 1.4 
nm gold clusters (Molecular Probes, Eugene, OR). 

5.1 DETECTABLE LABELS 

5.1.1 STRUCTURES OF THE COMPOUNDS AND 
LABELS 

In one aspect, the invention is directed to a class of compounds that are 
useful as labels for detection of nucleotide sequences, 

A first embodiment of the invention encompasses compounds of 

formula (I): 
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O R2 



Base N ^Linker) — Label 
I 

Sugar 

I 

alone or in combination with a counterion thereof or a stable salt, 
solvate, clathrate or mixture thereof, wherein: 

sugar is a ribose, deoxyribose or a ribose sugar analog; 
base is 



wherein sugar is covalently bonded to of the base; 

wherein Ri is -H, -NHC(0)NH2, -NH2, -OH, -O(alkyl), alkyl, CO2H; 
10 wherein R2 is -H, -NH2, -OH, -O(alkyl), alkyl, CO2H, -C02alkyl; 

linker is any linker known to those of skill in the art; and 

label is any label known to those of skill in the art. hi one embodiment 
linker is (alkylO)n, (NH)n, (C(0)NH)n, (NHC(0)NH)n, (NH-alkyl-O-alkyl-NH)n, or 
(alkyl)n; wherein n is an integer from 1 to 30. hi one embodiment, label is a 
15 colorimetric compoimd, a chemiluminescent compound, a bioluminescent compound, 
a quencher, a fluorescent compound or a non- or weakly fluorescent compound. 

A second embodiment of the invention encompasses a compound of 



formula 11: 



O R2 O 
H H 




20 II 

alone or in combination with a coxmterion thereof or a stable salt, 
solvate, clathrate or mixture thereof, wherein: 
base is 
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t 

wherein sugar is covalently bonded to of the base; 
wherein Ri is -H, -NHC(0)NH2, -NH2, -OH, -O(alkyl), alkyl, CO2H; 
wherein R2 is -H, -NH2, -OH, -O(alkyl), alkyl, CO2H, -C02alkyl; 
5 each of X, Y, and Z is independently -H, -OH, -O(alkyl), -SH, -SR4, 

-NHR4, -NR4R5, 

o 9x 9x 000 

R3O-P-O— , R3O-P-O-P-O— , or R3O-P-O-P-O-P-O— , 
OR3 OR3 OR3 OR3 6R3 6r3 

wherein R3 is -H or metal; 
R4 is -H or alkyl; 
10 R5 is -H or alkyl; 

linker is any linker known to those of skill in the art; and 
QisO, S,orNH;and 

label is any label known to those of skill in the art. 
In one embodiment of the compound of formula II, at least one of X, 
15 Y, and Z is -OH. 

In another embodiment of the compound of formula II, each of X, Y, 

and Z is -OH. 

In another embodiment of the compound of formula II, at least one of 
Y and Z is -OH and X is 

O 9^ 000' 

R3O-P-O— , R3O-P-O-P-O— , or R3O-P-O-P-O-P-O— 
20 OR3 OR3 OR3 OR3 OR3 OR3 

In another embodiment of the compound of formula 11, X is 

000 
R3O-P-O-P-O-P-O— 
OR3 OR3 OR3 

In another embodiment of the compound of formula II, X is 

O O 
II II 

R3O-P-O-P-O— 
OR3 OR3 

25 In another embodiment of the compound of formula II, X is 
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o 

R30-P-0— 

In another embodiment of the compound of formula II, K\ is -H, 
-NHC(0)NH2, -NH2, -OH, -O-alkyl, alkyl, -COOH. 

In another embodiment of the compound of formula II linker is 
5 (alkylO)n, (NH)n, (C(0)NH)n, (NHC(0)NH)n, (NH-alkyl-0.alkyl-NH)n, or (alkyl)n; 
wherein n is an integer from 1 to 30. 

In another embodiment of the compound of formula 11, the label is a 
colorimetric compound, a chemiliuninescent compound, a bioluminescent compound, 
a quencher, a fluorescent compound or a non- or weakly fluorescent compound 
10 In another embodiment of the compound of formula II, the Unker is 

(CH2CH20)n, wherein n is an integer between 1 and 10. 

In another embodiment of the compound of formula II, the label is 
biotin, a biotin derivative or a fluorophore. 

In another embodiment of the compound of formula II, the fluorophore 
15 is fluorescein, tetrachlorofluorescein, hexachlorofluorescein, Cy3, 

tetramethylrhodamine, Cy3.5, carboxy-x-rhodamine, Texas Red, Cy5, Cy5.5, 
phycoerythrins, or allophycocynanins. 

In another embodiment of the compound of formula II, the label is 

O NH 




20 Another embodiment encompasses a compoimd of formula III: 




In one embodiment of the compound of formula III, X, Y, and Z are 
independently H, OH, 
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o 

R30-P-0— 

OR. 



00 000 
II II II ii II 

R3O-P-O-P-O— , or R3O-P-O-P-O-P-O— 

OR3 OR3 6R3 



^3 OR3 OR3 

wherein R3 is -H or metal. 

In another embodiment of the compound of formula III, the compound 
is stereomerically pure. 

Another embodiment encompasses a compound of formula IV: 




IV 



wherein X, Y, and Z are independently H, OH, 



O 

II 



O 

It 



o 

II 



o 

II 



o 

II 



o 

M 



R3O-P-O— , R3O-P-O-P-O— , or R3O-P-O-P-O-P-O— 
OR3 ORs 6R3 OR3 6R3 ORs 

wherein R3 is -H or metal. 

Another embodiment encompasses a compound of formula V: 




000 
II II II 



wherein X, Y, and Z are independently H, OH, 

O 9 ^ 

R3O-P-O— , R3O-P-O-P-O— , or R3O-P-O-P-O-P-O— 
OR3 ORa 6r3 ORs 6R3 6R3 

wherein R3 is -H or metaL 

Another embodiment encompasses a compound of formula VI: 
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Me O 





H 



H 



VI 



15 



wherein X, Y, and Z are independently H, OH, 



O 

n 



O O 
II II 



o o o 

M II II 



R3O-P-O— , R3O-P-O-P-O— , or RgO-p-O-P-O-P-O— 



OR3 OR3 OR3 



OR3 OR3 OR3 

wherein R3 is H or metal. 

In another embodiment the compound is stereomerically pure. 
Another embodiment encompasses a compound of formula VII: 



Me O 



(sj-^ H H 





H 



H 



VII 



10 wherein X, Y, and Z are independently H, OH, 



O 
It 



O O 
II II 



000 
II II II 



R3O-P-O— , R3O-P-O-P-O— , or RgO-p-O-P-O-P-O— 



OR3 OR3 OR3 



OR3 OR3 OR3 

wherein R3 is H or metal. 

Another embodiment encompasses a compound of formula VIII: 

Me O 



H H 



X 





vm 



wherein X, Y, and Z are independently H, OH, 
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o ° 9 o o o 

R3O-P-O— . R3O-P-O-P-O— . or R3O-P-O-P-O-P-O— 
OR3 OR3 OR3 ORa ORa ORg 

wherein R3 is H or metal. 

Another embodiment encompasses a compound of formula IX: 



Me O 



<( n H H 



X 




N 



H 




IX 



wherein X, Y, and Z are independently H, OH, 



O 



O O 
II II 



000 

II M 



R3O-P-O— , R3O-P-O-P-O— , or R3O-P-O-P-O-P-O— 
OR3 ORa ORa ORg ORa ORa 

wherein R3 is H or metal. 

Another embodiment encompasses a compound of formula X: 



10 



Me O 



\ \\ H H 





H 



X 



wherein X, Y, and Z are independently H, OH, 



O 
II 



000 

R3O-P-O— , R3O-P-O-P-O— . or R3O-P-O-P-O-P-O— 
OR3 OR3 OR3 OR3 ORa ORa 

wherein R3 is H or metal. 



15 



Another embodiment encompasses a compound of formula XI: 
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O Me O 
K \\ H H 



H 




XI 

wherein X, Y, and Z are independently H, OH, 

O ^ ^ 

R3O-P-O— , R3O-P-O-P-O— , or R3O-P-O-P-O-P-O— 

OR3 OR3 OR3 6r3 6r3 ORa 

wherein R3 is H or metal. 

Another embodiment encompasses a compound of formula XII: 



O 
II 



O 

M 



o 

M 



II It II 

R3O-P-O-P-O-P-O 
*^ I I I 

OR3 OR3 OR3 



Me O 
\ H H H 





OH ^1=1 



xn 



10 



wherein R3 is H or metal. 

Another embodiment encompasses a compound of formula XIII: 



O O 
ii II 



o 

II 



R30-P-0-P-0-P-0 
OR3 OR3 OR3 



Me O 
\ I] H H 




XIII 



wherein R3 is H or metal. 

Another embodiment encompasses a compound of formula XIV: 



15 



000 
II 11 11 

R3O-P-0-P-0-P-0, 

OR3 OR3 OR3 



N 



Kl — 



Me O 





HN-f' 
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XIV 

wherein R3 is H or metal. 

Another embodiment encompasses a compound of formula XV: 



O Me O 

N^NH 
O^NH2 



000 

II 11 M 

R3O-P-O-P-O-P-O 




OR3 OR3 OR3 




XV 



wherein R3 is H or metal. 

Another embodiment encompasses a compound of formula XVI: 



O Me O 

f i H H 

RsO-P-O-P-O-P-O^ 7 

6r3 6r3 6r3 ^^^Tj; 0^NH2 



000 
II II II 



XVI 



wherein R3 is H or metal. 

Another embodiment encompasses a compound of formula XVII: 



000 
ti 11 II 

R3O-P-0-P-0-P-0 
'^1 I I 

OR3 OR3 OR3 




O Me O 

<^ II H H 
N^NH 

O^NH2 



XVII 



wherein R3 is H or metal. 

Another embodiment encompasses a compound of formula XVIII: 



HN- 





000 
n II ii 

R3O-P-O-P-O-P-O 
6R3 OR3 OR3 



O Me O 





"oh oh 
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XVIII 



wherein R3 is H or metal. 

Another embodiment encompasses a compound of formula XIX: 




000 

II M il 

I I I 

OR3 OR3 OR3 



XIX 

wherein R3 is H or metal. 

Another embodiment encompasses a compound of formula XX: 



N 



000 
II II 11 
R3O-P-O-P-O-P-O 

OR3 OR3 OR3 




N 



O Me O 



N 





XX 



10 wherein R3 is H or metal. 

Another embodiment encompasses an oligonucleotide comprising a 
detectable label of formula (XXI): 

Q R2 



Base 



Sugar 



r-^N^^CLinker)— I 
H 



Label 



XXI 



IS alone or in combination with a counterion thereof or a stable salt, solvate, clathrate or 
mixture thereof, wherein: 

sugar is a ribose, deoxyribose or a ribose sugar analog; 
base is 

20 wherein sugar is covalently bonded to of the base; 
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15 



wherein R, is -NHC(0)NH2, -H, -NH2, -OH, -O(alkyl), alkyl, or 

CO2H; 

wherein R2 is -H or alkyl; 

linker is (CH2CH20)n, (CH20)n or (CH2)n; 

wherein n is an integer from 1 to 30; and 

label is a colorimetric, chemiluminescent, bioluminescent, a 
fluorescent compound, or a non- or weakly fluorescent compound. 

Another embodiment encompasses an oligonucleotide comprising a 
detectable label of formula XXII: 

O R2 O 

Base 




10 



H H 



XXII 



alone or in combination with a counterion thereof or a stable salt, solvate, clathrate or 
mixture thereof, wherein: 
base is 



wherein sugar is covalently bonded to of the base; 

wherein Ri is -NHC(0)NH2, -H, -NH2, -OH, -O(alkyl), alkyl, or 



CO2H; 



wherein R2 is -H or alkyl; 
20 each of X, Y, and Z is independently -H, -OH, -O-alkyl, -SH, -SR4, 

-NHR4, -NR4R5, 

O ^? 9 ^ u ^ 

R3O-P-O— , R3O-P-O-P-O— , or R3O-P-O-P-O-P-O— . 
OR3 ORa ORa ORg ORg ORg 

wherein R3 is -H or metal; 

R4 is -H or alkyl; 

25 R5 is -H or alkyl; 

linker is (CH2CH20)n, (CH20)n or (CH2)n; 
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wherein n is an integer from 1 to 30; 
Qis O, S,orNH; and 

label is a colorimetric, chemiluminescent, bioluminescent, a 
fluorescent compound, or a non- or weakly fluorescent compound. 
5 In one embodiment at least one of X, Y, and Z is -OH. 

In another embodiment each of X, Y, and Z is -OH. 

In another embodiment at least one of Y and Z is -OH and X is 

O O O O O O 

R3O-P-O— , RgO-P-O-P-O— , or R3O-P-0-P-O-P-O— 
OR3 OR3 OR3 OR3 OR3 6R3 

wherein R3 is -H or metal. 

10 In another embodiment X is 

000 
R3O-P-O-P-O-P-O— 
OR3 OR3 OR3 

wherein R3 is -H or metal. 

In another embodiment X is 

O O 

M II 
R3O-P-O-P-O— 

OR3 OR3 

15 wherein R3 is -H or metal. 

In another embodiment X is 

O 

R3O-P-O— 
6R3 

wherein R3 is -H or metal. 

In another embodiment Ri is -NHC(0)NH2. 
20 In another embodiment linker is (CH2CH20)n, wherein n is an integer 

from 1 to 10. 

In another embodiment label is biotin, a biotin derivative or a 

fluorophore. 

In another embodiment the fluorophore is fluorescein, 
25 tetrachlorofluorescein, hexachlorofluorescein, Cy3, tetramethylrhodamine, Cy3.5, 
carboxy-x-rhodamine, Texas Red, Cy5, Cy5.5, phycoerythrins, or allophycocyanins. 
In another embodiment the label is 
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10 



15 



HN 



O 




NH 



HN 



N 



or 




NH 



XXIII: 




s-^ s- 
Another embodiment encompasses a detectable label of formula 



O Me O 

K It H H 
N^NH 

O^NH2 

H 




XXIII 

wherein X, Y, and Z are independently H, OH, 



O 

II 



o o 

II II 



o o o 

II II II 



R3O-P-O— , R3O-P-O-P-O— , or R3O-P-O-P-O-P-O— 



OR 



OR3 OR3 OR3 



^3 OR3 OR3 

wherein R3 is -H or metal. 

In another embodiment the detectable label is stereomerically pure. 
Another embodiment encompasses a detectable label of formula 

XXIV: 



O Me O 
<C II H H 

O^NHa 

H 




X 




XXIV 

wherein X, Y, and Z are independently H, OH, 



O 
II 



o o 

M II 



000 

11 II M 



R3O-P-O— , R3O-P-O-P-O— , or R3O-P-O-P-O-P-O— 
OR3 ORa ORa 6R3 ORa ORg 

wherein R3 is -H or metal. 



Another embodiment encompasses a detectable label of formula XXV: 
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H 



O lyie o 
<( i H H 

O^NH2 



H 




XXV 



wherein X, Y, and Z are independently H, OH, 



O 
II 



O O 
II II 



o o o 

M II II 



R3O-P-O— , R3O-P-O-P-O— , or RaO-P-O-P-O-P-O— 
1 I I •'ill 



OR3 OR3 OR3 



OR3 OR3 OR3 

wherein R3 is -H or metal. 

Another embodiment encompasses a detectable label of formula 

XXVI: 



Me O 



N-^ H 





XXVI 

10 wherein X, Y, and Z are independently H, OH, 



O 
II 



o o 

II It 



000 

II II II 



R3O-P-O— , RaO-P-O-P-O— , or RgO-p-Q-P-O-P-O— 
OR3 OR3 OR3 ORs 6R3 6r3 

wherein R3 is -H or metal. 

In another embodiment the detectable label is stereomerically pure. 
Another embodiment encompasses a detectable label of formula: 



Me O 



N— ' H 

'Vie 




XXVII 

wherein X, Y, and Z are independently H, OH, 
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R3O-P-O— , R3O-P-O-P-O— , or R3O-P-O-P-O-P-O— 
ORs ORa ORs 6R3 6R3 6R3 

wherein R3 is -H or metal. 

Another embodiment encompasses a detectable label of fomiula 

XXVIII: 





H 



H 



xxvm 

wherein X, Y, and Z are independently H, OH, 



O 

R3O-P-O— 



O 

11 



O 
II 



o 

11 



o 

II 



o 

tl 



R3O-P-O-P-O— , or R O-P-O-P-O-P-O— 



6r3 ORa 



OR3 OR3 OR3 



OR3 

wherein R3 is -H or metal. 

Another embodiment encompasses a detectable label of formula 

XXIX: 



O Me O 



<( H H H 





H 



XXIX 



wherein X, Y, and Z are independently H, OH, 



O 

R3O-P-O— 



O 

n 



O 

H 



R3O-P-O-P-O— 
OR3 OR3 



II II II 

or R3O-P-O-P-O-P-O- 



OR3 OR3 OR3 



OR3 

wherein R3 is -H or metal. 

In another embodiment the detectable label is stereomerically pure. 
Another embodiment encompasses a detectable label of formula XXX: 
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O Me O 
K \\ H H 




XXX 



o o o 

II II 



wherein X, Y, and Z are independently H, OH, 

O ^? 9 

R3O-P-O— , R3O-P-O-P-O— , or R3O-P-O-P-O-P-O— 
OR3 GR3 OR3 OR3 OR3 OR3 

wherein R3 is -H or metal. 

Another embodiment encompasses a detectable label of formula 

XXXI: 



O Me O 




N 

< i 
N 



H 





XXXI 

10 wherein X, Y, and Z are independently H, OH, 



O 

11 



O O 
11 ^ n 



000 

tl II M 



R3O-P-O— , RaO-P-O-P-O— , or R3O-P-O-P-O-P-O— 
OR3 OR3 OR3 OR3 OR3 OR3 

wherein R3 is -H or metal. 

Another embodiment encompasses a detectable label of formula 

xxxn: 



15 



000 
II II II 

R3O-P-O-P-O-P-O 
OR3 OR3 0R3 



N 
N 



< JTh 





XXXII 



wherein R3 is H or metal. 
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Another embodiment encompasses a detectable label of formula 



XXXIII: 



o o o 

II tl II 

R30-P-0-P-0-P-0 

•^1 I I 

0R3 OR3 OR3 



Me O 
\ J H H 





.0 



XXXIII 



wherein R3 is H or metal. 

Another embodiment encompasses a detectable label of formula 

XXXIV: 



Me O 
\ J H H 



000 

II M II 



R3O-P-O-P-O-P-O 



I 1 1 

OR3 OR3 OR3 





'OH 



XXXIV 



10 



wherein R3 is H or metal. 

Another embodiment encompasses a detectable label of formula 

XXXV: 



000 
II II II 

R30-P-0-P-0-P-0. 

6R3 6R3 ORa 1^0- 




O Me O 

<( II H H 
N^NH 

O^NH2 



XXXV 




15 



wherein R3 is H or metal. 

Another embodiment encompasses a detectable label of formula 

XXXVI: 
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o o o 

M II II 

R30-P-0-P-0-P-0 

^1 I I 
OR3 OR3 OR3 




OH 



O Me O 

K y H H 



XXXVI 




oH 



wherein R3 is H or metal. 

Another embodiment encompasses a detectable label of formula 

XXXVH: 



O Me O 

K II H H 

R30-P-0-P-0-P-0>^ 7 

6R3 6R3 6R3 cr^NH2 



O O O' 
II l{ II 




XXXVII 



10 



wherein R3 is H or metal. 

Another embodiment encompasses a detectable label of formula 

XXXVIH: 



15 



000 
R3O-P-O-P-0-P-0 
OR3 OR3 OR3 



O Me O 





"oh oH 



xxxvm 

wherein R3 is H or metal. 

Another embodiment encompasses a detectable label of formula 

XXXDC: 



000 
II II II 

R3O-P-O-P-O-P-O 
OR3 OR3 OR3 





o 



N-^NH 
O^NHa 



O Me O 

H 




"oh oh 



XXXIX 



-39- 



CAI: 361909.1 



wherein R3 is H or metal. 

Another embodiment encompasses a detectable label of formula 

XXXX: 



O Me O 

K II H H 
N^NH 

O^NH2 



000 
II II . II 

RgO-P-O-P-O-P-O^ 
t I I 

OR3 OR3 OR3 




XL 



wherein R3 is H or metal. 



HN— 



5.1.2 DETECTABLE LABEL BASE 

Compounds of formula I described herein can be used as detectable 
labels. Imidazole compoimds are in principle compatible as bases in the detectable 
labels of formula I. Thus, the invention encompasses bases including, but not limited 
to, imidazoles and substituted imidazoles. Preferred bases include, but are not limited 
to: 

wherein the sugar of the nucleoside or nucleotide is covalently bonded 
to of the base; and 

wherein Ri is -NHC(0)NH2, -H, -NH2, -OH, -O(alkyl), alkyl, or 

CO2H. 

5.1.3 LINKING GROUPS OR LINKERS 

The invention comprises conjugating or linking a nucleoside, 
nucleotide, or a nucleoside or nucleotide analogs to compounds of formula I. The 
compounds of formula I are conjugated via the linking group by a variety of means, 
including ionic attraction or covalent attachment. Preferably, the compounds are 
linked via covalent attachment. The linking groups or linkers are capable of reacting 
with a "complementary functionality*' of a reagent, e.g., a nucleoside or nucleotide or 
nucleic acid, and forming a "linkage" that connects the compound to the label. The 
linker can be any linker known to those of skill in the art. Preferred linking groups 
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include but are not limited to, (CH2CH20)n, (CH20)n, or (CH2)n; wherein n is an 
integer from about 1 to about 30, preferably n is an integer from about 2 to about 20, 
more preferably n is an integer from about 3 to about 10. Optionally, the linker is a 
covalent bond {i.e., label may be bonded to the base by a covalent bond). 

5 5.1.4 LABELS 

The label can be any detectable label known to those of skill in the art. 
Labels that can be used with compounds of formula I include biotin, biotin derivatives 
and any fluorophore. hi certain embodiments, labels containing biotin or a 
fluorescent reporter may be used. Flourescent reporters may be selected from 

10 xanthene dyes, including fluoresceins, and rhodamine dyes. Many suitable forms of 
these compounds are corrmiercially available having various substituents on their 
xanthene rings that can be used as a site for bonding or as the bonding functionality 
for attachment to a reagent, preferably an oligonucleotide. Another group of 
fluorescent labels are the naphthylamines and their derivatives, having an amino 

1 5 group in the alpha or beta position. Licluded among such naphthylamino compounds 
are l-dimethylaminonaphthyl-5 -sulfonate, I -anilino-8-naphthalene sulfonate and 2-p- 
toluidinyl-6-naphthalene sulfonate. Other labels include, but are not limited to, 
coumarins, such as 3-phenyl-7-isocyanatocoumarin; acridines, such as 9- 
isothiocyanatoacridine and acridine orange; N-(p-(2-benzoxazolyl)phenyl)- 

20 maleimide; cyanines; BODIPY® dyes (Molecular Probes, Eugene, OR); 
benzoxadiazoles; stilbenes; pyrenes; and the like. 

Preferably, labels are selected from biotin, fluorescein and rhodamine 
dyes. More preferably the label is biotin. These labels and appropriate linking 
methodologies for attachment to oligonucleotides are described elsewhere (Khanna 

25 et al. U.S. Patent No. 4,439,356; Marshall, 1975, HistochemicalJ., 7:299-303; 
Menchen et al., U.S. Patent No. 5,188,934; Menchen et aL, European Patent 
Application No. 87310256.0; Bergot et al.. International Application 
PCT/U590/05565; and Barone et aL, 2001, Nucleosides, Nucleotides, and Nucleic 
Acids, 20(4-7): 1141-1145). 

30 In certain specific embodiments, the label is biotin, fluorescein, 5- 

carboxyfluorescein (FAM), 2'7 -dimethoxy-4'5 -dichloro-6-carboxyfluorescein (JOE), 
rhodamine, 6-carboxyrhodamine (R6G), A^,7V,A^' A/^'tetramethyl-6-carboxyrhodamine 
(TAMRA), 6-carboxy-X-rhodamine (ROX), 4-(4'-dimethylaminophenylazo) benzoic 
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acid (DABCYL), 5-(2*-aminoethyl)aminonaphthalene 1 sulfonic acid (EDANS), 
tetrachlorofluorescein (TET), indodicarbocyanine 3 (Cy3), indodicarbocyanine 5 (Cy 
5), indodicarbocyanine 5.5 (Cy 5.5), 3-( -carboxy pentyl)-3*-ethyl-5,5'- 
dimethyloxacarbo-cyanine (CyA), or 1H,5H,1 lH,15H-xantheno[2,3,4 ij:5,6,7- 
5 i*j']diquinolizin-18-ium, 9-[2(or 4)-[[[6 [2,5-dioxo-l-pyrrolidinyl)oxy]-6- 

oxohexyl]amino] sulfonyl]-4(or 2)-sulfophenyl] 2,3,6,7,12,13, 16,17-octahydro-, inner 
salt (TR or Texas Red). Representative fluorophores include: 

Fluorophore Emission Max 

Fluorescein 520 nm 

Tetrachlorofluorescein (TET) 536 nm 

Hexachlorofluorescein (HEX) 556 nm 

Cy3 570 nm 

Tetramethylrhodamine (Tamra) 580 nm 

Cy3.5 596 nm 

Carboxy-x-rhodamine (Rox) 605 nm 

Texas Red 610 nm 

Cy5 667 nm 

Cy5.5 694 nm 

5.1.5 SUGARS 

1 0 The sugars that are generally compatible with compounds of formula I 

include, ribose and ribose analogs of formula: 

RO. 




Hi TH 
OR' OR" 

wherein R, R*, and R" are independently H, metal, or a mono-, di-, or 

triphosphate. 

1 5 5.2 PURIFICATION OF DETECTABLY LABELED 

OLIGONUCLEOTIDES 

Oligonucleotides containing a compound of formula I can be purified 
by any method known to one of ordinary skill in the art. In one nonlimiting 
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embodiment, the oligonucleotides are purified by reverse phase HPLC. Briefly, the 
sample containing the oligonucleotide is loaded on a Hamilton PRP-1 column (1.0 cm 
X 25 cm) and eluted with a linear 5% to 50% acetonitrile gradient over 40 minutes, 
HPLC fractions corresponding to the fluorescent labeled oligonucleotide species are 
5 collected, pooled, and lyophilized. Following lyophilization, the oligonucleotide 
samples can be dissolved in water and precipitated, for example with lithium 
perchlorate, followed by centrifugation, e.g., at I0,000g for 10 minutes. The resulting 
pellet can then be washed with 10% aqueous acetone. This method yields "standard 
purity" single reverse phase HPLC (RP-HPLC) oligonucleotides. 

10 The oligonucleotides optionally can be repurified by ion exchange 

HPLC to yield "high purity" dual reverse phase plus ion exchange HPLC (lE+RP 
HPLC) purified oligonucleotides. In one embodiment, the oligonucleotides are 
repurified by loading them on a 5x10 Source column (Amersham Pharmacia Biotech, 
Piscataway, NJ) and eluted using a linear 0% to 50% gradient of 1 M LiCl, 0.1 M 

15 TRIS buffer pH 8.0. HPLC peaks corresponding to the oligonucleotide species are 
collected, pooled, precipitated with lithium perchlorate, and lyophilized. 

Compound and/or oligonucleotide identities can be verified after 
synthesis and purification by mass spectroscopy, for example using a Voyager DE 
BioSpectrometry workstation (PE BioSystems, Framingham, MA). 

20 5.3 DETECTION OF DETECTABLE LABELS 

The detectable label can be detected by any means known to those of 
skill in the art to be suitable for detection of the label without limitations. For 
instance, certain modified nucleotides, oligonucleotides, and polynucleotides of the 
invention may be detected by contacting the compounds with polypeptides (e.g., 

25 avidin or an antibody) which are capable of forming complexes under suitable 

conditions, provided that the polypeptides include one or more moieties which can be 
detected when the complex or complexes is or are formed, generally by means of 
conventional detection techniques. Other detectable labels can be detected, for 
example, by spectroscopic techniques. 

30 5.3.1 AVIDIN 

One polypeptide detector for a biotin-labeled detectable label of the 
invention is avidin. If avidin is coupled to potentially demonstrable indicator 
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molecules, (e.g., fluorescent dyes such as, but not limited to, fluorescein or 
rhodamine), electron dense reagents {e.g., ferritin, hemocyanin, or colloidal gold), or 
enzymes capable of depositing insoluble reaction products (e.g., peroxidase or 
alkaline phosphatase) the presence, location and/or quantity of the probe can be 
5 established. 

Avidin-biotin techniques can be used in flow cytometry and light, 
electron and fluorescence microscopy, as well as with solution based methods such as 
enzyme linked immunosorbent assays (ELISAs). hi addition, avidin-biotin and 
antibody-hapten techniques are frequently combined for simultaneous, multicolor 

10 detection of multiple targets in a complex solution, cell or tissue sample. Furthermore, 
by judicious choice of detection reagents and sandwich protocols, these techniques 
can be employed to amplify the signal from low-abundance analytes. For example, 
the bridging method is a common immunohistochemical technique for signal 
amplification and improved tissue penetration in which avidin or streptavidin serves 

15 as a bridge between two biotinylated molecules. 

Avidin has, unfortunately, one property that makes if less desirable as a 
biotin indicator protein when used in conjunction with nucleic acids or chromatin 
material. It has been reported that avidin binds tightly to condensed chromatin or to 
subcellular fi-actions that contain large amounts of nucleic acid in a manner which is 

20 independent of its biotin-bindihg property (Heggeness, 1978, Stain Techno!., 52: 165; 
Heggeness et al., 1977, J. Cell Biol, 73: 783; and Bayer et al, 1980, Methods of 
Biochemical Analysis, 26: 1). 

5.3.2 STREPTAVIDIN 

Another probe for biotin-containing nucleotides and derivatives is . 

25 streptavidin, an avidin-like protein synthesized by the soil organism Streptomyces 
avidinii. Its preparation and purification is described in Hoffinan, et al, 1980, Proc. 
Natl Acad. Sci., 11\ 4666, which is incorporated herein by reference. Streptavidin 
has a much lower pi (5.0) than avidin, is non-glycosylated, and shows much lower 
non-specific binding to DNA than avidin, and therefore offers potential advantages in 

30 applications involving nucleic acid detection methodology. 

The protein streptavidin is a potential alternative to anti-biotin IgG as a 
vehicle to specifically direct a coupled visualization system {e.g., fluorescent probes 
{see § 4.6.1.1. below) or histochemical reagents {see § 4.3.3. below) to the site of the 



-44- 



CAI: 361909.1 



hybridized biotin-containing polynucleotide. One of streptavidin's advantages over 
anti biotin IgG is that its affinity for biotin is Kassn =10*^ whereas association 
constants for haptene IgG interactions are 10^ to 10*^. The fast reaction rate and 
extreme affinity mean that the time required to localize the biotinized probe will be 
minutes with streptavidin versus hours with immunologic reagents. 

The analysis of protein from tissue samples using a biotin streptavidin 
detection system is well known in the art as exemplified by western blot detection 
using the Bio Rad Amplified Alkaline Phosphatase Immun-Blot Detection (BioRad 
Laboratories, Hercules, CA). This detection format uses a biotinylated secondary 
antibody with the amplification of streptavidin. Then biotinylated alkaline 
phosphatase completes the sandwich and color developing reagent (e.g., 5-bromo-4- 
chloro-indoyl phosphate and nitroblue tetrazolium) is added. This forms a purple 
precipitate at any point of binding of the biotinylated alkaline phosphatase. 

5.3.3. MONOSPECIFIC RABBIT IgG 

ANTIBIOTIN IMMUNOGLOBULIN 

A protein for biotin like probe detection is monospecific rabbit IgG, 

antibiotin immunoglobulin. This compound may be prepared by immunizing rabbits 

with bovine serum albumin-conjugated biotin, followed by purification by affinity 

chromatography (Berger, 1979, Methods in Enzymology, 62: 319). The anti -biotin 

antibodies have proven extremely useful in detecting specific polynucleotide 

sequences on chromosomes by in situ hybridization since little, if any, non specific 

binding of the antibody to chromatin material occurs. 

5,3.4 IMMUNOLOGICAL AND 

HISTOCHEMICAL METHODS 

Immunological and histochemical methods for the detection of biotin 
have shown that the basic approach is useable for a rapid method of gene mapping 
in situ hybridization and non-radioactive procedures for detecting specific nucleic 
acid sequences by blotting hybridization methods. Use may be made of this 
technology in development of new clinical diagnostic procedures. 

Using this approach, it is possible to determine the presence of a 
specific deoxyribonucleic or ribonucleic acid molecule, particularly such a molecule 
derived from a living organism {e.g., bacteria, fungus, virus, yeast, or mammal). This 
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in turn permits diagnosis of nucleic acid containing etiological agents in a patient or 
other subject. 

Moreover, it provides a method for screening bacteria to determine 
antibiotic resistance. Thus, for example, penicillin resistance in Streptococcus 
5 pyogenes or Neisseris meningitidis', tetracycline resistance in Staphylococcus aureus, 
Candida albicans, Pseudomonas aeruginosa. Streptococcus pyogenes, or Neisseria 
gonorrhoeae', and aminoglycoside resistance in Mycobacterium tuberculosis can be 
determined. 

In these methods a polynucleotide is prepared, which is 
10 complementary to the nucleic acid sequence, which characterizes the organism or its 
antibiotic resistance and which additionally includes one or more modified 
nucleotides according to the invention. This polynucleotide is hybridized with a 
nucleic acid obtained from the organism being studied. Failure to hybridize indicates 
absence of the organism or of the resistance characteristic. Hybridized nucleic acid 
15 duplexes are then identified by forming a complex between the duplex and a suitable 
polypeptide, which carries a detectable moiety, and detecting the presence of the 
complex using an appropriate detection technique. Positive detection indicates that 
the complex, the duplex and therefore the nucleic acid sequence of interest are 
present. 

20 This approach can be extended to the diagnosis of genetic disorders, 

including, but not limited to, thalassemia and sickle cell anemia. The 
deoxyribonucleotide acid gene sequence whose presence or absence (in the case of 
thalassemia) is associated with the disorder can be detected following hybridization 
with a polynucleotide probe according to this invention based upon complex 

25 formation with a suitable detectable polypeptide. 

The mapping of genes or their transcripts to specific loci on 
chromosomes is a tedious and time consuming occupation, involving mainly 
techniques of cell fusion and somatic cell genetics. Although in situ hybridization has 
been employed successfiilly for mapping single copy gene sequences in species that 

30 undergo chromosomes polj^enization, such as Drosophila, detection of unique 
sequence genes in most higher eukaryotic chromosomes has been difficult using 
standard hybrization methods. Since recombinant DNA technology has made feasible 
the molecular cloning of virtually every single-copy sequence found in eukaryotic 
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cells, it would be extremely beneficial to have a rapid and sensitive method for 
mapping the chromosmal origin of such cloned genomic fragments. 

Modified nucleotides may be used in a method of gene mapping by 
in situ hybridization, which circumvents the use of radioisotopes. This procedure 
5 takes advantage of a thymidine analog containing biotin that can be incorporated 
enzymatically into DNA probes by nick translation. After hybridization in situ the 
biotin molecules serve as targets for affinity purified anti biotin antibodies. 

It has been found that polytene chromosomes could be used as a test 
system for establishing the efficacy of probes using the modified nucleotides 

10 according to the invention as detected by indirect immunofluorescence for in situ gene 
mapping. Many of these clones have already been assigned to specific bands on the 
Drosophila chromosome map by conventional in situ hybridization methods 
employing radioisotopes. 

It has also been shown that this immunological method also works 

1 5 with mammalian chromosomes wherein satellite DNA has been mapped to the 

centromeric regions of mouse metaphase chromosomes. The result provides a basic 
foundation for the development of a simple gene mapping procedure for single copy 
(unique) sequences in chromosomes firom human and other mammals. Such a 
procedure should greatly facilitate our understanding of the genetic organization of 

20 the chromosome and make clinical cytogenetic diagnosis much more rapid and 
practical. 

While a single step "antibody sandwich" method in which the 
chromosome spread is challenged, post hybridization, with rabbit anti biotin IgG may 
succeed, this protocol may not generate sufficient fluorescence for unambiguous gene 

25 assignments. However, a much stronger fluorometric signal C2in be achieved by using 
the "haptene-antibody sandwich technique" described by Lamm et al., (1972); and 
Wofsy, et al., (1974). In this procedure the primary antibody {e.g.y monospecific, 
rabbit anti biotin IgG) is chemically modified with a haptenization reagent, such as 
2,4-dinitrofluorobenzene, preferably while the immunoglobulin is bound to an antigen 

30 affinity column (biotin Sepharose™). As many as 15-20 haptene (DNP) groups can 
be coupled to the primary antibody without decreasing its antigen binding affinity or 
specificity {See e.g., Wallace and Wofsy, 1979). If the primary antibody treatment of 
the test sample is followed by an incubation with a fluorescently labeled anti hapten 
IgG antibody, rather than a fluorescently labeled anti IgG, a 5-7 fold increase in 
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fluorescence signal can be achieved. Since one also has available monospecific guinea 
pig anti DNP IgG, the secondary antibody can be haptenized with biotin and thus 
generate two anti-hapten IgG populations, DNP labeled anti-biotin IgG and biotin- 
labeled anti-DNP IgG. These can be used alternately to achieve several rounds of 
5 hapten-antibody sandwiching and then followed with fluorescently labeled protein A 
from Staphylococcus aureus, which binds specifically to IgG molecules from many 
mammalian species, and result in an enormous amplification of the primary antibody 
signal with its concomitant utility. 

Polytene chromosomes hybridized with biotinized DNA probes can be 

10 incubated with streptavidin followed by a subsequent incubation with bovine serum 
albumin which has been doubly labeled with biotin and FITC (FITC, biotinyl-BSA). 
Since only one of the four streptavidin subunits is likely to be involved in binding at 
each biotinized DNA site, potentially one labeled BSA molecule can bind to each of 
the remaining three nonconjugated subunits of the streptavidin biotinyl nucleotide 

15 complex. The fluorescence signal from this single streptavidin+FITC, biotinyl-BSA 
layer will be compared with a control using the basic "antibody sandwich method." 

If the "antibody sandwich" and streptavidin+FITC, biotinyl-BSA 
detection intensities are comparable, one can attempt to enhance the 
streptavidin+FITC, biotinyl BSA system to single copy copy sensitivity in a manner 

20 that parallels the multiple "haptene antibody sandwich" approach. Since some of 
biotin groups on BSA will not be bound to the first layer of streptavidin, a second 
layer of streptavidin can be added until sufficient signal is obtained. For example, if 
in the second layer, only two streptavidin protomers bind to each first layer BSA and 
each of these streptavidin protomers binds three FITC biotinyl BSA molecules, then 

25 the second layer intensity will be twice as great as that from the first layer; for the 
third layer, with analogous binding stoichiometrics, the fluorescent intensity will be 
12-fold that of the first layer, so the total intensity will rapidly increase. With 
successively added layers larger carrier protein such as thyroglobulin rather than BSA 
can be used in order to maximize amounts of attached fluorescent and biotin probes. 

30 It may also be necessary to use a longer linker arm between the biotin probe and the 
carrier protein. A longer linker arm should sterically optimize the theoretical delivery 
of a biotinized fluorescent carrier molecule to each nonconjugated streptavidin 
subunit and maximize the number of streptavidin protomers in the subsequent layer 
which will bind to the biotinized fluorescent carrier. Appropriate controls are done to 
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insure that substitution of the carrier protein with fluorescent probes and biotin does 
not cause solubiHty and/or nonspecific binding problems. 

The streptavidin-carrier protein dehvery system has two significant 
advantages over the immunfluorescent approach in addition to its speed of deUvery. 
5 First, only two protein components are needed to form the layers. Second, only the 
carrier protein needs to be modified and it is not necessary to maintain functional or 
even total structural integrity as long as the biotin groups are accessible to 
streptavidin. 

An alternative to the fluorescence method for visualizing hybridized 

10 probes is to direct enzymes such as peroxidase, alkaline phosphatase of jS- 

galactosidase to the hybridization site where enzymatic conversion of soluble 
substrates to insoluble colored precipitates permits light microscope visualization. 
The important advantage of this technique is that the histochemical methods are 10 to 
100-fold more sensitive than fluorescence detection. In addition, the colored 

1 5 precipitates do not bleach with extensive light exposure thus avoiding one of the 
general disadvantages of fluorescent light microscopy. These enzymes can be 
coupled to the final antibody instead of fluorescent probes in the "haptene antibody 
sandwich" technique using bifiinctional reagents such as glutaraldehyde or in the case 
of peroxidase via oxidation of the peroxidase carbohydrate moieties to aldehydes and 

20 coupling of these residues with €-amino groups of the desired protein. For the 
streptavidin biotinized carrier protein method, an enzyme with biotinyl groups 
coupled to it could replace a fluorescently biotinized carrier system. Alternately, the 
enzyme could be coupled via biotin to the last layer of streptavidin with amplification 
of streptavidin sites being built up in preceding layers using biotinized BS A or 

25 thyroglobulin. 

Detecting and/or imaging very low levels of fluorescent light is 
possible using currently available image intensifiers or systems composed of lasers 
and photomultipliers. These methods permit the detection of light down to the level 
of individual photons. With suitable digital processing systems, images can be 
30 produced in which each point (i.e., each pixel) of the image is strictly proportional to 
the number of photons emitted by a point at the object. Using systems of this kind or 
flow systems in which the cells or parts of cells flow past a laser beam, one can obtain 
detection sensitivity increases for fluorescent material of factors between 100 and 
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1000 beyond that which can be detected by the eye. This increase is sufficient to 
detect the fluorescence of single copy genes. 



5.3.5 OTHER DETECTION TECHNIQUES 

Detectable labels of formula I can also be conjugated directly to signal 
5 generating compounds, for example, by conjugation with an enzyme or fluorophore. 
Illustrative enzymes of the invention include, but are not limited to, hydrolases, 
particularly phosphatases, esterases and glycosidases, oxidoreductases, and 
peroxidases. Illustrative fluorescent compounds of the invention include, but are not 
limited to, those disclosed in section 4.1.4 above and also include rhodamine and its 

10 derivatives, dansyl, and umbelliferone. Illustrative chemiluminescent compounds of 
the invention include, but are not limited to, luciferin and 2,3 
dihydrophthalazinediones, for example, luminol. 

Means of detecting labels are well known to those of skill in the art. 
For example, where the label is a fluorescent label, it can be detected by exciting the 

1 5 fluorochrome with the appropriate wavelength of light and detecting the resulting 
fluorescence, for example, by microscopy, visual inspection, via photographic film, 
by the use of electronic detectors such as charge coupled devices or photomultipliers 
and the like. Similarly, enzymatic labels can be detected by providing appropriate 
substrates for the enzyme and detecting the resulting reaction product. Finally, simple 

20 colorimetric labels are often detected simply by observing the color associated with 
the label. Thus, in various dipstick assays, conjugated gold often appears pink. 

5.4 PHOSPHORAMIDITE SYNTHESIS 

Another preferred class of reagents of the invention are 
phosphoramidite compounds that incorporate the detectable labels of the invention. 
25 Phosphoramidite reagents have the general structure: 

rr 

and are capable of forming a covalent bond with a hydroxyl group of a sugar {e.g., the 
3' or 5' hydroxyl group) of a detectable label of the invention. Accordingly, the 
invention provides methods of making phosphoramidite compounds from any of 
30 compounds of fomiula I-XL, which comprise a phosphoramidite reagent at the 5*-OH 
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or the 3'-OH. Such phosphoramidite reagents are particularly useful for the 
automated chemical synthesis of nucleic acids labeled with the detectable labels of the 
invention. Such phosphoramidite reagents, when reacted with a hydroxyl group, such 
as a 5 -hydroxyl group of a nucleoside or nucleotide or nucleic acid, form a phosphite 
5 ester linkage which, in tum, is oxidized to yield a phosphate ester linkage. For a 
detailed discussion of phosphoramidite chemistry see, eg., Carruthers et aL^ U.S. 
Patent Nos. 4,458,066 and 4,415,732 and Gait, 1985, Oligonucleotide Synthesis: A 
Practical Approach^ ERJL Press, Oxford, England, each of which is incorporated 
herein by reference. The phosphoramidite reagents can be nucleosidic or non 
10 nucleosidic. 

5.5 DETECTABLE LABEL-CONTAINING OLIGONUCLEOTIDES 

The invention further provides oligonucleotides comprising the 
detectable labels of the invention. One or more of the nucleotides in the 
oligonucleotides can be a ribonucleotide or a nucleotide analog. An oligonucleotide 

15 of the invention can be of any length, but is preferably at least 8 nucleotides long. In 
other embodiments, an oligonucleotide of the invention is 10, 12, 15, 18, 20 or more 
nucleotides in length. In certain embodiments, the oligonucleotide is less than 50, less 
than 40, or less than 30 nucleotides in length. 

Oligonucleotides of the invention can typically form complementary 

20 base pairing and give an -OH group serving as the origin of synthesis of 

complementary chain at the 3' terminus. Accordingly, its backbone is not necessarily 
limited to the one via phosphodiester linkages. For example, it may be composed of a 
phosphorothioate derivative having S in place of O as a backbone or a peptide nucleic 
acid based on peptide linkages. 

25 In general, detectable labeled oligonucleotides can be synthesized with 

a variety of different solid supports including, but not limited to, controlled pore glass 
or polystyrene. 

5.6 METHODS UTILIZING DETECTABLE LABELS 

Several hybridization assay formats that employ detectable labels as a 
30 means for detecting hybridization have been described, some of which are described 
below. These assays are useful for, e.g.^ detecting the presence of a specific 
nucleotide sequence in a sample, detecting the presence of contiguous sequences on a 
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target nucleic acid, for detecting the presence of mutations within a target nucleic acid 
sequence, monitoring the kinetics of nucleic acid hybridization, and monitoring the 
progression of PGR reactions. It will be appreciated that the hybridizing sequences 
need not have perfect complementarity to provide stable hybrids. In many situations, 
5 stable hybrids will form where fewer than about 10% of the bases are mismatches, 
ignoring loops of four or more nucleotides. Accordingly, as used herein the term 
"complementary," unless otherwise indicated, refers to an oligonucleotide that forms 
a stable duplex with its "complement" under assay conditions, generally where there 
is about 90% or greater homology. 

10 5.6.1 DETECTION OF TARGET NUCLEIC ACIDS 

Described herein are exemplary embodiments that employ detectable 
labels of the invention for detection of nucleic acid hybridization events. 

While the assay formats described herein are represented in terms of 
systems employing only a single detectable label, multi-reporter systems may also be 
1 5 practiced. Such multi-reporter systems are advantageous in applications requiring the 
analysis of multiple hybridization events in a single reaction volume. In such 
systems, each of the reporter molecules produces an emission that is spectrally 
resolvable among the emissions of the other reporters. 

5.6.1.1 PROBE METHODS 

20 In one assay format, a single oligonucleotide probe is used. The probe 

comprises a detectable label of the invention. The detectable label can be attached to 
the 3* end of the probe or the 5' end, or can be at any nucleotide within the probe 
sequence. 

The basic principles of oligonucleotide probe design and synthesis are 
25 well known. Those basic principles apply generally to the design and synthesis of 
oligonucleotide probes used in this invention. 

The invention includes an essentially unlimited number of different 
oligonucleotide probes. For example, an oligonucleotide probe used according to this 
invention can vary considerably in length. The preferred length will depend on 
30 considerations such as the target nucleic acid length, length of the nucleotide 

sequence of interest, target nucleic acid type (i.e., DNA or RNA), target nucleic acid 
G-C content, and the spacing of the detectable label on the probe. Preferably, the 
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length of the probe used in practicing this invention is from 20 to 200 nucleotides. 
More preferably, the length is from 40 to 100 nucleotides. 

Regardless of probe length, the probe can vary in nucleotide sequence. 
The nucleotide sequence of a probe used in this invention will depend on the sequence 
5 of the target nucleic acid. The probe's nucleotide sequence should have sufficient 
complementarity to the target nucleic acid to allow hybridization with the target 
nucleic acid xmder the chosen in situ hybridization conditions. Preferably, base pair 
matching between the probe and target nucleic acid is at least 80%. More preferably, 
the base pair matching is approximately 100%. 
10 The total number, and the spacing, of the detectable label in the 

oligonucleotide can vary. Preferably, a detectable label is incorporated within five 
bases from the 3* end of the oligonucleotide. 

5.6.1.2 5'^ 3' EXONUCLEASE ASSAY 
In yet another assay format, referred to herein as the 5'-* 3* 

15 exonuclease assay, a doubly labeled probe including both a reporter label such as, for 
example, a fluorophore, and a quencher label, such as, but not limited to, 
tetramethylrhodamine. Black Hole Quenchers (BioSearch Technologies, Novato, 
CA), DABCYL, or QSY-4, is digested upon hybridization to a target nucleic acidj 
thereby liberating one or both of the labels from the probe (Lee et al., 1993, USA^ 

20 88:7276-7280; U.S. Patent Nos. 5,538,848 and 5,210,015, which are incorporated 
herein by reference). In the methods of the invention, either the fluorophore or the 
quencher or both can be incorporated into an oligonucleotide using a detectable label 
of the invention. In this method, the doubly labeled probe is hybridized to a target 
nucleic acid. In addition, an oligonucleotide primer is hybridized to the target nucleic 

25 acid at a position upstream from the probe, i.e., in the 5' direction relative to the probe 
and closer to the 3'-end of the target nucleic acid. The primer is then extended using a 
polymerase enzyme having 5'-» 3' exonuclease activity , e.g., a DNA polymerase, 
thereby forming an extended primer. During the primer extension reaction, the 5*-^ 3' 
exonuclease activity of the polymerase serves to digest the probe so as to form a first 

30 probe fragment including the reporter label and a second probe fragment including 
the quencher label. Thus, the reporter and quencher labels are separated, and the 
emission of the reporter becomes unquenched. This process can be performed using 



-53- 



CAI: 361909.1 



TaqMan® kits and reagents available from Roche Molecular Diagnostics, Pleasanton, 
CA. 

5.6.1.3 REAL-TIME PCR 

The assays described herein may be conducted in combination with a 
5 nucleic acid amplification step, e.g., PGR. That is, prior to conducting the 

hybridization assay, all or part of the nucleic acid sample may be amplified. When 
performed in combination with an amplification step, the hybridization assay may be 
conducted in an end-point mode or a real-time mode. In an end-point mode, the 
hybridization assay is performed after the amplification reaction is complete, e.g., 

10 after all or substantially all of the amplification cycles of a PCR reaction have been 
completed. In a real-time mode, a hybridization assay is performed multiple times 
during the amplification reaction, for example, after each thermocycle of a PCR 
process (Higuchi, U.S. Patent Nos. 5,994,056 and 6,171,785, each of which is 
incorporated herein by reference). The real time mode is preferred when a 

15 quantitative measure of the initial amount of target nucleic acid is required, e.g., 
where the copy-number of pathogen nucleic acid present in a blood sample. 

5.6.1.4 DETECTION OF RIBONUCLEASE ACTIVITY 

Detectable label-containing nucleic acids can also be used to detect the 
presence and/or amount of RNase activity in a sample. 

20 Briefly, ribonuclease activity is detected by incubating a synthetic 

oligonucleotide that serves as the RNase substrate with the sample, for a time 
sufficient for cleavage of the substrates by a ribonuclease enzyme. The substrate 
comprises a single stranded nucleic acid molecule containing at least one 
ribonucleotide residue at an internal position that functions as a cleavage site, a 

25 reporter dye on one side of the cleavage sites, and a quencher dye on the other side of 
the cleavage site. Upon cleavage of the intemal ribonuclease residue, the reporter 
dye, whose emission was quenched by the quencher dye, becomes detectable. Thus, 
detection of a fluorescence signal indicates that a ribonuclease cleavage event has 
occurred, and, therefore, the sample contains ribonuclease activity. 

30 In one embodiment, the RNase detection methods of the invention are 

conducted in conjunction with one or more positive controls. In one embodiment, the 
positive control is a quantitative positive control. 
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5.6.1.5 DETECTION OF ENDONUCLEASE 
ACTIVITY 

Detectable label-containing nucleic acids can also be used to detect the 
presence and/or amount of endonuclease activity in a sample. 
5 Briefly, endonuclease activity is detected by incubating a labeled 

oligonucleotide with the sample, for a time sufficient for cleavage of the substrates by 
an endonuclease enzyme, or other molecule having endonucleolytic activity. The 
substrate comprises a single stranded nucleic acid molecule containing at least one 
deoxyribonucleotide residue at an internal position that functions as a cleavage site, a 

10 reporter dye on one side of the cleavage sites, and a quencher dye on the other side of 
the cleavage site. Upon cleavage of the intemal ribonuclease residue, the reporter 
dye, whose emission was quenched by the quencher dye, becomes detectable. Thus, 
detection of a fluorescence signal indicates that an endonucleolytic cleavage event has 
occurred, and, therefore, the sample contains endonuclease activity. 

15 In one embodiment, the endonuclease detection methods of the 

invention are conducted in conjunction with one or more positive controls. In one 
embodiment, the positive control is a quantitative positive control. 

5.6.1.6. DETECTION OF TdT ACTIVITY 

Detectable label-containing nucleic acids can also be used to detect the 
20 presence and/or amount of terminal deoxynucleotidyl transferase (TdT) activity in a 
sample. 

Briefly, TdT activity is detected by incubating an unlabeled oligo- or 
polynucleotide with the sample in the presence of a nucleotide triphosphate bearing a 
label, for a time sufficient for addition to the substrate of one or more labeled 
25 nucleotide triphosphates by TdT enzyme, or other molecule having TdT activity. The 
substrate comprises a single- or double-stranded nucleic acid molecule containing at 
least one free 3' OH group. The addition of the label to the substrate can be detected 
upon separation of the oligo- or polynucleotide from the labeled nucleotide 
triphosphate. 

30 In one embodiment, the TdT detection methods of the invention are 

conducted in conjunction with one or more positive controls. In one embodiment, the 
positive control is a quantitative positive control. In one embodiment, the oligo- or 
polynucleotide is labeled at an intemal site with a quencher, and the labled nucleotide 



-55- 



CA I: 361909.1 



triphosphate bears a fluorescent molecule as a label, hi another embodiment, the 
oligo- or polynucleotide is labeled at an internal site with a fluorophore, and the 
labeled nucleotide triphosphate bears a quencher molecule as a label. In either of the 
above two embodiments, addition of the labeled nucleotide triphosphate enables real- 
5 time detection of the TdT activity. Such an assay requires that the label on the oligo- 
or polynucleotide is positioned such that fluorescence resonant energy transfer 
(FRET) can occur between the quencher and the fluorophore. 

5.6.1.7 CHROMOSOMAL KARYOTYPING 

The invention also provides a method of chromosomal karyotyping. In 
10 this method, modified polynucleotides are prepared which correspond to known genes 
and include modified nucleotides. These polynucleotides are hybridized with 
chromosomal deoxyribonucleic acid and the resulting duplexes contacted with 
appropriate polypeptides under suitable conditions to permit complex formation. The 
polypeptides include detectable moieties so that the location of the complexes can be 
1 5 determined and the location of specific genes thereby fixed. 

5.6.1.7 DETECTION OF TUMOR CELLS 
Timior cells can be diagnosed by preparing polynucleotides, which are 
modified according to this invention and are complementary to the messenger 
ribonucleic acid synthesized firom a deoxyribonucleic acid gene sequence associated 
20 with the production of polypeptides, such as a-fetal protein or carcinoembryonic 

antigen, the presence of which is diagnostic for specific tumor cells. Hybridization 
and detection of hybrid duplexes thus would provide a method for detecting the tumor 
cells. 

5.7 KITS 

25 The present invention fiirther provides kits comprising the 

compositions of the invention. Such kits can be useful, inter alia, for practicing the 
methods described herein, or to provide materials for synthesis of the compositions 
described herein. Generally, a kit of the invention comprises in one or more 
containers a detectable label of the invention or an oligonucleotide that incorporates a 

30 detectable label of the invention. Additional components can be included in the kit, 
depending on the particular application that utilizes the compounds of the invention. 
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For example, where the kit is directed to the real time PGR assays 
described in Section 5.6. 1 .3, the kit may further comprise a DNA polymerase. Where 
a kit of the invention is intended for the practice of the RNase detection assays of 
Section 5.6.1.4, the kit may further comprise, for example RNase-free water. Where 
5 the kit is directed to end-labeling a double- or single-stranded DNA molecule, the kit 
can further comprise a terminal deoxynucleotidyl transferase. Where the kit is 
directed to assaying an enzyme activity, such as endonuclease activity or terminal 
deoxynucleotidyl transferase activity, the kit can further contain a standard amount of 
endonuclease or terminal deoxynucleotidyl transferase, respectively, of a known 
10 specific activity. 

Such kits may optionally contain one or more of: a negative and/or 
positive control, a buffer, and instructions for use of the kit's reagents. Where the kit 
is intended for diagnostic applications, the kits may further include a label indicating 
regulatory approval for the diagnostic application. 

15 5.8 GENERAL SYNTHESIS OF DETECTABLE LABELS 

The invention will be further clarified by a consideration of the 
following illustrative examples, which are intended to be purely exemplary of the 
invention and not to in any way limit its scope. 

The compounds of the invention can be obtained via the synthetic 
20 methodology illustrated in Schemes 1-5. Starting materials useful for preparing the 
compounds of the invention and intermediates thereof, are commercially available or 
can be prepared firom commercially available materials using known synthetic 
methods and reagents. 

5.8.1 SCHEME 1 : SYNTHESIS OF LINKER LABEL 
25 COMPOUND 

/P o o 

^^^/(UnkerK^ ^ r^N-Q-^Label Solvent^ (Linker)^ A 




H2N N Label 

H 

1 O 2 3 



Qis NH2 
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Scheme 1 

Scheme 1 outlines a method useful for synthesizing compounds 
comprising a linker and a label, wherein linker is for example, but not limited to, 
polyethylene glycol or polymethylene and label is a colorimetric, chemi luminescent, 
bioluminescent, or flourescent emitting compound. 

First, the carboxyl group of compound 2 is substituted with a N- 
hydroxysuccinimide (NHS) group to facilitate amidation and peptide coupling. 
However, other suitable coupling groups are well known in the art, for example, but 
not limited to l-[bis(dimethylamino)methylene]-hexafluorophosphate-l,3-oxide-0- 
(benzotriazol-1 -yl) N,N,N*,N'-tetramethyluronium-hexafluorophosphate (HBTU), 
dicyclohexylcarbodiimide (DCC), or 0-(7-azabenzotriazol-l-yl) l,l,3-',3- 
tetramethyluronium hexafluorophosphate (HATU). Diamines 1, wherein, for 
example, the linker is tetraethyleneglycol or tetramethyleneglycol, are available 
conmiercially (e.g., Aldrich Chemical Co., Milwaukee, Wisconsin). In addition, Q 
can be SH or OH, in which case the amine can optionally be protected. Organic 
solvents suitable for the conversion of compound 1 to compound 3 include, but are 
not limited to, dichloromethane, diethyl ether, tetrahydrofuran, benzene, toluene, 
xylene, hydrocarbon solvents (e.g., pentane, hexane, and heptane), and mixtures 
thereof. After addition of compound 1 to compound 2, the reaction mixture is 
allowed to stir for about 2 days, and the reaction mixture is allowed to stir until the 
reaction is substantially complete as determined by using an appropriate analytical 
method, preferably thin layer chromatography or high performance liquid 
chromatography. Then the reaction mixture is quenched by methods known in the art 
and compounds 2 can be isolated by work up and purified if desired by methods 
known in the art, for example, column chromatography or preparative HPLC. 
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5.8.2 SCHEME 2: SYNTHESIS OF PROTECTED 

a -SUBSTITUTED-LINKER-LABEL COMPOUND 



H2N 
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(Linker)^ 



Label 
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Scheme 2 



Scheme 2 illustrates the coupling of the linker label compound 3 with 



the R2-substituted compound 4 to form the -linker-label compound 5, wherein R2 is 
described herein and -Pg is a protecting group. 



of 4 and 1 .5 equivalents of 3 is dissolved in a suitable organic solvent under an inert 
10 atmosphere. The use of polar aprotic solvents (e.g., dimethylformamide, acetonitrile, 
or dimethyl sulfoxide) is preferred. The solution is maintained at a constant 
temperature within the range of about 15° C to about room temperature, preferably at 
room temperature and base is added, preferably an amine base, such as, but not 
limited to, diisopropylethylamine. Preferably, about 1 0 equivalents of the substituted 
15 amine is added in one single addition. After addition of the amine base, the reaction 
mixture is allowed to stir for about 1 to about 4 hours at room temperature, preferably 
at a rate such that the reaction mixture temperature remains constant. The reaction 
mixture is allowed to stir at this temperature until the reaction is substantially 
complete as determined by using an appropriated analytical method, preferably thin- 
20 layer chromatography or high perfomiance liquid chromatography. The reaction 

mixture is then quenched to afford a mixture containing compound 5. Compound 5 . 
can purified by techniques known in the art (e.g., preparatory HPLC or flash 
chromatography). However, the crude preparation containing compound 5 could also 
be used in the next step without further purification. 



Preferably, to synthesize compound 5, a solution of about 1 equivalent 
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5.8.3 



SCHEME 3: DEPROTECTION OF 

a-SUBSTITUTED-LINKER-LABELED- 

COMPOUND 



Pga 




Label 



Deprotecting 
Agent 



5 



QisNH 



H2N 




Label 



6 



Scheme 3 



Scheme 3 outlines methods useful for deprotecting the substituted 



linker labeled compound 5. A typical procedure for deprotection involves dissolving 
compound 5 in a polar organic solvent. Organic solvents suitable for the conversion 
of compound 5 to compound 6 include, but are not limited to, methanol, 
dimethylformamide, acetonitrile, or dimethyl sulfoxide. After compound 5 is 
dissolved a deprotecting agent is added. Suitable deprotecting agents will remove the 
protecting group without reacting with any other moiety in the compound. Examples 
of suitable deprotecting reagents include, but are not limited to, piperdine, 
morpholine, and DBU. The reaction mixture is allowed to stir at room temperature 
until the reaction is substantially complete (e.g., about 12 hours) as determined by 
using an appropriate analytical method, preferably ninhydrin, thin layer 
chromatography, or high performance liquid chromatography. Then the reaction 
mixture is quenched and compound 6 can be isolated by work up and purified if 
desired by methods known in the art, for example, column chromatography or 
preparative HPLC. 
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5.8.4 SCHEME 4: SYNTHESIS OF OXANOSINE 
COMPOUND 




R, R', and R" are independently 
H, Protecting group, or mono-, 
di,- or triphosphate 



Scheme 4 

Purine sugar compounds of formula 7 can be converted to oxanosine 
sugar compounds of formula 8 by dissolving compound 7 in a buffer (e.g., sodium 
acetate) of pH 3-4 and sterile water at room temperature. Sodium nitrite is then added 
and the solution is stirred at room temperature. The reaction mixture is allowed to stir 
at room temperature imtil the reaction is substantially complete (e.g,, about 24 hours) 
as determined by using an appropriate analytical method, preferably high performance 
liquid chromatography. The reaction mixture is then quenched and compound 8 can 
be isolated by work up and purified if desired by methods known in the art, for 
example, column chromatography or preparative HPLC. The compound can be 
further purified if desired by, for example, lyophilization with water. 
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5.8.5 SCHEME 5: SYNTHESIS OF UREA- 
SUBSTITUTEDIMIDAZOLE AMIDE 
DETECTABLE LABEL 




Label 



9 



5 The detectable label compound 8 can be synthesized by contacting 

oxanosine compound 8 in a buffer (e.g., sodium borate) with substituted linker label 
compound 6 in sterile water. The reaction mixture is kept in an incubator with gentle 
stirring at 34° C to 40° C until the reaction is substantially complete {e.g.^ about 24 
hours) as determined by using an appropriated analytical method, preferably high 
10 performance liquid chromatography. The reaction mixture is then quenched and 

compound 9 can be isolated by work up and purified if desired by methods known in 
the art, for example, column chromatography or preparative HPLC. The compound 
can be further purified if desired by, for example, lyophilization with water. 
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5.8.6 SCHEME 6: SYNTHESIS OF UNSUBSTITUTED 
IMIDAZOLE-AMINE DETECTABLE LABELS 




OR* OR" 

12 

The detectable label 12 can be synthesized by first glycosylating a 
5 halogen substituted imideizole 10 to yield the glycosylated base 11 using methods 
known in the art, for example, 1995, Nucl Acid Res, 23(4): 647-653 and 1996, 
Tetrahedron Asymmetry, 7(12): 3455-3464, each of which are incorporated herein by 
reference. Glycosylated base 1 1 is then contacted with substituted-linker-label 
compound 6 under conditions suitable for the formation of detectable label 12. See, 

10 e.g., 2000, J. Heterocyclic Chemistry, 37 (1): 1 19-126; 1987, Australian J. Chemistry, 
40(8): 1399-1413; 1977, Khim. Farm. ZA., 11(10): 38-42; 2000, Chemistry of 
Heterocyclic Compounds, 36(2): 182-184; 1982, Indian J. Chemistry, Sect. B, 
21B(10): 928-640, each of which is incorporated herein by reference. Detectable 
label 12 can be purified if desired by methods known in the art, for example, column 

15 chromatography or preparative HPLC. The compound can be further purified if 
desired by, for example, lyophilization with water. ' 
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5.9 



SCHEME 7: SYNTHESIS OF UNSUBSTITUTED 
IMIDAZOLE AMIDE DETECTABLE LABELS 



.COOR 

O - 

N 
H 

R = Methyl, Ethyl 
10 



RO. 



Hi~~T H 



COOR 



.COOR 



ROv 



-O^ 
OR' OR" 
11 (a): R = Methyl, Ethyl 



OR' OR' 
11(b) 



H2N 



R2 O o 



Label 



H 



6 

Q = NH 



COOR 



RO. 



N 

O^ 
OR' OR" 
11 (b):R = H 




H 




Label 



13 

The detectable label 13 can be synthesized by first glycosylating a 
5 carboxyfunctionalized imidazole base 10 to yield the glycosylated base 1 1 (a) using 
methods known in art, for example, 1995, Nuc, Acid Res, 23(4): 647-653; 1995, 
Bioorg. Med. Cherh, Lett., 5 (15): 1679-1684; and 1996, Tetrahedron Asymmetry, 
7(12): 3455-3464; each of which are incorporated herein by reference. Glycosylated 
base 11 (a) can be converted to 11 (b) and then reacted with label compound 6 under 
10 conditions suitable for the formation of detectable label 13. See, e.g., 2000, J. 

Heterocyclic Chemistry, 37 (1): 1 19-126; 1987, Australian J. Chemistry, 40(8): 1399- 
1413; 1977, Khim, Farm. Zh., 11(10): 38-42; 2000, Chemistry of Heterocyclic 
Compounds, 36(2): 182-184; 1982, Indian J. Chemistry, Sect. B, 21B(10): 928-640. 
Detectable label 13 can be purified if desired by methods known in the art, for 
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example, column chromatography or preparative HPLC. The compoimd can be 
further purified if desired by, for example, lyophilization with water. 



5.10 EXAMPLES: SYNTHESIS OF COMPOUNDS OF THE 
INVENTION 



10 



15 



20 



5.10.1 EXAMPLE 1: SYNTHESIS OF BIOTIN-PEG- 
AMINE 




A 2.0 L round bottom flask was charged with tetraethyleneglycol 
diamine (7.0 g, 36.4 mmol) (commercially available from Molecular Biosciences; 
Boulder, CO) and dichloromethane (600 ml). To this clear solution was added a 
suspension of Nhydroxysuccinimide ("NHS") biotin (2.73 g, 8.0 mmol) 
(commercially available from Molecular Biosciences) in dichloromethane (100 ml) 
over a period of 50 minutes. The reaction was allowed to stir at room temperature for 
two days. The solvent was removed on the rotary evaporator under reduced pressure 
and the product was purified by column chromatography (70% yield). 

5.10.2 EXAMPLE 2: SYNTHESIS OF FMOC-METHYL- 
PEG-BIOTIN 




O I O 




25 



H 



To a solution of FMOC-homoalanine (0.65 g, 2.0 nmiol) 
(commercially available from Fluka), biotin PEG amine (0.56 g, 1.34 mmol) in 
DMSO DMF (6 ml, 5:1 ratio) was added HBTU in DMF (5 ml, 1.0 nunolar solution) 
under cooling condition. After 5 minutes of stirring N,N-diisopropylethylamine (1 .0 
ml) was added and the reaction mixture was stirred for 1 .0 hr at room temperature. 
The solvent was evaporated under reduced pressure and the product was purified by 
column chromatography (50% yield). 
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5.10.3 EXAMPLE 3: SYNTHESIS OF METHYL-PEG- 
BIOTIN 



H2N 




The FMOC protecting group was removed from amine using 1 , 8 



5 diazabicyclo [5.4.0]-undec-7-ene (DBU). To the solution of FMOC homoalanine 
PEG biotin (0.25 g, 0.345 mmol) in methanol was added DBU (0.2 g, 1.31 mmol) at 
room temperature and reaction was stirred ovemight. Methanol was removed under 
vacuum and the product was purified by column chromatography (87% yield). 



(commercially available from Sigma, St. Louis, MO) was dissolved in a solution of 
sodium acetate buffer (2 ml, 12 M, pH 3.7) and sterile water (4 ml) at room 

15 temperature. To this clear solution was added sodium nitrite (50 mg, 0.724 mmol, 
1 .88 equiv) in sterile water (2 ml) at room temperature with gentle stirring and the 
reaction left for 24 hours. Analytical HPLC of this reaction showed one major 
(xanthosine) and one minor (oxanosine) peak and no detectable starting material. The 
compound was purified using a preparative column (Zorbax, SB-C18, 21.2 mm x 25 

20 cm. Part # 880975-102) (commercially available from Agilent Technologies (Palo 
Alto, CA). The collected fractions containing the suspected product were pooled 
together and lyophilized. The purified oxanosine triphosphate was obtained as a 
white solid (42 mg, 21% yield) after two lyophilizations with water. 



5.10.4 EXAMPLE 4: SYNTHESIS OF OXANOSINE 
TRIPHOSPHATE 




Guanosine triphosphate (210 mg, 0.384 mmol, lithium salt) 



25 
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5.10.5 EXAMPLES: SYNTHESIS OF a-METHYL-PEG- 
BIOTIN IMIDAZOLE TRIPHOSPHATE 




To a solution of oxanosine triphosphate (15 mg, 0.0286 mmol) in 



5 sodium borate buffer (12 ml, 250 mM, pH 9.0) was added methyl-PEG-biotin (20 mg, 
0.0398 mmol, 1 .39 equiv) in sterile water (1 .0 ml). This reaction mixture was kept in 
an incubator at 38^ C for 24 hours with gentle movement using a shaker. The reaction 
mixture was analyzed by HPLC and found desired product, significant amount of 
oxanosine and some amine. The compound was purified by RP-HPLC (Symmetry 
10 Semi prep column, C-18, 7.8 x 300 mm. Part # WAT 066235, from Waters 
Corporation, Milford, MA) and isolated 1 1 mg (40% yield). 

5.10.6 TDT LABELING WITH IMIDAZOLE-PEG- 
BIOTIN 

The detectable labels of the invention include, but are not limited to, 
15 non radioactive ribonucleotide analogs purified by reverse phase HPLC and 

quantified by molecular mass. The detectable labels can be used, for example, as a 
substrate for terminal deoxynucleotidyl transferase (TdT). As an illustrative example 
imidazole-PEG-biotin can be transferred to the 3' hydroxyl terminus of double or 
single stranded DNA molecules for template independent biotinylation. DNA 
20 fragments labeled with imidazole-PEG-biotin hybridize efficiently to oligonucleotide 
microarrays and can readily be detected via staining with streptavidin fluorescent dye 
conjugates. 

5.10.7 FRAGMENTATION AND LABELING 

25 Genomic DNA samples (50 ng in 10 mM TRIS buffer, pH 8.3) were 

mixed with a master mix, MgCh, and a set of primers appropriate for amplifying a 
desired set of CYP450 polymorphisms. The master mix contained the following 
components: TRIS HCl buffer, pH 8.3; ammonium sulfate; KCl; DNA polymerase, 
uracil-N-glycosylase, and each of the deoxynucleotide triphosphates dATP, dCTP, 
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dTTP, dGTP and dUTP. The reaction was subjected to the following thermocycling 
profile: 50° C for 2 minutes, 95° C for 10 minutes, 35 cycles of 95° C for 15 sec, 67° 
C for 4 minutes, followed by an incubation of 7 minutes at 72° C, and held at 4° C. 

An aliquot of amplicons from the PGR reaction product from the PGR 
5 reaction was incubated with the mixture of DNase I and phosphatase to cleave 
amplicons to 25-100 bp length and dephosphorylate dNTPs in PGR mixture. 
Fragmented amplicons were subsequently labeled with imidazole-PEG-biotin by TdT. 

5.10.7.1 DNASE I FRAGMENTATION REACTION 

PGR reaction products were combined with a DNase fragmentation 
10 reagent containing DNase I, EDTA, alkaline phosphatase, and water. The mixture 
was incubated at 25° G for 20 minutes, then 95° G for 10 minutes. The fragmented 
amplicons were then held at 4° G prior to initiating the TdT reaction. 

5.10.7.2 TdT LABELING REACTION 

The DNase fragmented amplicons were then incubated with the 
1 5 labeling mixture including TdT buffer (IM potassium cacodylate, 250 mM Tris 

base, 10 mM GoGb, and 0.2 mM DTT, pH 7.6), imidazole PEG biotin, TdT , water 
and stirred at 37° G for 1 hour, then 95° G for 5 minutes. The imidazole-PEG-biotin 
labeled amplicons were then held at 4° G prior to transfer to hybridization solution. 

5.10.8 MICRO ARRAY HYBRIDIZATION 

20 The labeled amplicon was subsequently transferred to microarray 

hybridization buffer (SSPE, Triton X-100, standardization oligonucleotide, Denhardt's 
solution, and deionized water. 

The denatured sample was loaded onto a fluidics station for 30 minute 
hybridization at 45° G followed by 50° G washes, 10 minute staining with streptavidin 

25 phycoerythrin and washed at room temperature. The microarray was scanned by a 
laser to generate fluorescent intensity values that are converted by software into 
genotype calls. Hybridization performance with PEG-biotin imidazole compound 
was monitored by several metrics: background intensity (BG), chip intensity values: 
key perfect match (KeyPM for key mutation sites) and reference perfect match 

30 (RefPM), pass rates (key and reference), and discrimination ratios for key mutation 
sites. 
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To assess the transfer efficiency and functional performance of 
imidazole PEG-biotin on microarrays, two clinical samples and a control sample 
were labeled with varying amounts of TdT (200, 400, and 800 units) in the presence 
250 fiM imidazole-PEG-biotin. 

5 



TABLE 1 


Amount 
of TdT 


Sample 


KeyPM 
Int 


RefPM 
Int 


EG 


KeyPass 
tcate 


RefPass 
ivate 


0\J\J vj 


v^oniroi 






lOO.OJ 


OA 1 o/ 


Ql >1 0/ 
yi.H /o 




1 1 ^0 RO 

1 1 OU.Ov/ 


1091 fid 


O / 


OR 1 % 


27 J. y /O 




705.93 


559.19 


80.81 


100% 


92.0 % 


400 U 


Control 


689.06 


584.20 


82.33 


100 % 


91.9% 


A 


926.92 


812.97 


82.20 


97.2 % 


92.9 % 


B 


637.59 


517.60 


81.61 


97.2 % 


92.0 % 


200 U 


Control 


412.43 


355.46 


80.02 


85.2 % 


75.3 % 


A 


824.73 


723.47 


83.45 


95.4 % 


91.0% 


B 


499.68 


414.32 


104.86 


98.1 % 


91.4% 



The control is genomic DNA sample and is commercially available from Coriell Cell Repository, 
reference # GM 099 1 2 . 

^ Sanfiple A and B are clinical sanples of CYP2Dss26 gene duplication. 

10 

Table 1 shows the efficiency of labeling nucleosides with imidazole 
PEG biotin labels. Particularly, Table 1 illustrates the efficiency of imidazole PEG 
biotin labels in detection of CYP450 polymorphisms. 

The present invention is not to be limited in scope by the specific 
15 embodiments described herein. Indeed, various modifications of the invention in 
addition to those described herein will become apparent to those skilled in the art 
from the foregoing description. Such modifications are intended to fall within the 
scope of the appended claims. ~ 

Various publications, patents and patent applications are cited herein, 
20 the disclosures of which are incorporated by reference in their entireties. 
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